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PREFACE 


Chaos  and  nonlinear  dynamics  can  be  expected  to  play  an  increasingly  important  role  in 
atmospheric  research  and  applications  to  Army  problems.  This  report  is  intended  to  give  a 
survey  of  the  whole  field  of  nonlinear  dynamics  (or  “chaos  theory',”  as  it  is  popularly  called) 
in  a  compressed  form.  It  is  slightly  expanded  from  a  series  of  lectures  given  over  the  space 
of  a  single  month  in  1989.  This  young  and  rapidly  growing  field  is  already  very  extensive, 
so  that  this  survey  cannot  be  deep  or  detailed.  In  particular,  no  pretense  of  mathematical 
rigor  is  made.  But  I  do  insist  on  stating  key  definitions  or  theorems  carefully'  so  that 
the  reader  need  not  settle  for  just  a  qualitative,  intuitive  understanding.  My  intention  is 
to  touch  on  thf’  main  ideas  so  that  the  reader  can  see  if  his  or  her  special  discipline  fits 
in  anywhere  and  if  so,  can  get  an  approximate  notion  of  what  new  ideas  or  possibilities 
nonlinear  dynamics  brings  to  that  field.  The  cited  literature  then  allows  the  reader  to 
proceed  further  if  he  or  she  desires. 

I  thank  Harry  Auvermann  for  suggesting  that  I  give  these  lectures  in  me  first  place,  for 
inviting  me  then  to  write  them  up  in  the  present  more  coherent  form,  and  for  taking  care 
of  all  the  official,  administrative,  and  practical  details.  My  especial  thanks  go  to  David 
Bandelier  who  worked  with  me  to  turn  my  manuscript  and  rough  sketches  into  print.  It 
is  a  great  comfort  to  see  one’s  handwritten  manuscript  come  back  promptly,  transformed 
into  practically  flawless  typescript  on  the  first  draft. 
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1.  INTRODUCTORY  REMARKS 
1.1  Linear  Versus  Nonlinear 

A  dynamics'  describes  the  time  evolution  of  a  system.  As  such,  the  concept  is  not 
confined  to  physics,  but  occurs  in  many  other  fields  as  well  -  in  related  sciences  like 
engineering,  chemistry,  and  biology,  but  also  in  ecology,  economics,  etc.  A  nonlinear 
dynamics  describes  the  time-evolution  via  nonlinear  equations  of  motion,  which  may  be 
ordinary  differential  equations,  partial  differential  equations,  difference  equations,  iteration 
of  maps,  etc.  Nonlinear  motion  equations  have  been  around  a  long  time  -  since  the 
beginning  of  science,  in  fact  -  so  why  the  sudden  blooming  of  nonlinear  dynamics  as  a  new 
discipline  in  the  last  20  years  or  so? 

The  answer  is  that  up  to  that  time  nonlinear  equations  were  regarded  as  not 
essentially  different  from  linear  ones  -  more  complicated  and  difficult  to  solve,  of  course, 
but  nothing  that  suitably  refined  linear  approximations  couldn’t  handle.  Analytic 
(“closed  form”)  solutions  were  emphasized  in  textbooks  with  the  confident  expectation 
that  “nonanalytic”  solutions,  if  they  existed,  formed  a  small  subset  of  all  solutions  which 
didn’t  greatly  add  to  the  understanding  of  the  phenomena.  But  about  20  years  ago  it 
was  realized  that  nonlinear  equations  are  essentially  different  from  linear  ones,  that  they 
possess  properties  which  can  never  be  captured  by  linear  approximations,  that  analytic 
solutions  are  the  exception,  not  the  rule,  and  that  solutions  sets  may  show  “deterministic 
chaos.” 

Linear  equations  enjoy  by  definition  the  property  of  superposition.  That  is,  linear 
combinations  of  solutions  are  also  solutions:  the  solutions  form  a  linear,  or  vector,  space. 
Linear  theories  are  highly  structured  theories,  and  one  has  many  helpful  theorems  at  hand. 
For  example,  a  general  solution  exists,  solutions  have  only  “fixed”  singularities,  that  is, 
those  occurring  in  the  linear  equations  themselves.  But  do  not  get  the  idea  that  linear 
theories  are  considered  passe  or  discredited,  now  that  we  are  elucidating  the  mysteries 
of  nonlinear  dynamics.  Some  of  the  most  beautiful  and  accurate  theories  in  physics  are 
linear.  Witness  the  Maxwell  theory  of  electromagnetism,  or  quantum  mechanics  itself, 
the  fundamental  theory  of  the  subatomic  world.  Indeed,  today  no  failure  of  quantum 
mechanics  is  known. 

Nonlinear  equations  are  all  the  rest:  all  those  which  are  not  linear.  Most  of 
the  convenient  properties  of  a  linear  dynamics  mentioned  above  are  lost:  there  is  no 
superposition  and  no  general  solution;  analytic  solutions  are  rare  or  nonexistent;  solutions 
may  have  singularities  not  present  in  the  motion  equations,  and  these  may  depend  on 
the  initial  conditions,  etc.  However,  interesting  new  properties  show  up  in  compensation. 
Asymptotic  (time  oo)  solutions  are  often  independent  of  initial  conditions  and  lie  on 
low-dimensional  “attractors”  in  phase  space.  There  is  a  complicated  set  of  these  stable 
regimes  joined  by  bifurcations  of  various  types.  There  may  exist  “chaotic”  regimes. 

Incidentally,  the  reader  should  not  worry  if  some  of  the  statements  in  these  prelimi¬ 
nary  remarks  of  chapter  1  seem  a  bit  vague  or  elusive  by  reason  of  undefined  terms.  For 
now,  it  is  enough  that  they  carry  some  sort  of  intuitive  meaning.  All  important  terms  and 
concepts  will  be  defined  carefully  at  the  proper  places  in  this  report. 
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1.2  The  Goals  of  Nonlinear  Dynamics 

The  dynamics  that  will  be  our  main  focus  of  attention  in  this  report  will  be  specified 
by  one  or  several  first  order  ordinary  differential  equations  in  time, 

(1  -  1) 

or  by  a  map  x  i — ►  F(x )  which  is  iterated: 

*,  F(x),  J[F(*)|,  JW(*)]1.  (1-2) 

The  set  of  continuous  time  solutions,  or  orbits  of  (1-1)  is  called  a  flow ,  while  the  set 
of  discrete-time  orbits  (1-2)  is  sometimes  called  a  cascade.  Both  /  and  F  depend  on 
parameters  which  can  be  varied. 

What  information  do  we  seek  in  nonlinear  dynamics? 

a.  The  geometry,  or  more  often,  topology  of  the  flow  (or  cascade)  as  a  whole:  phase 
portraits,  stable  and  unstable  manifolds,  various  low-dimensional  invariant  attracting  sets 
if  they  exist. 

b.  Bifurcation  points,  that  is,  those  parameter  values  at  which  the  flow  “changes 
qualitatively.” 

c.  The  characteristics  of  “chaotic”  flows,  and  the  various  paths  to  chaos  which  the 
dynamics  admits. 

What  mathematical  tools  are  available  for  this  search?  There  exist  theorems,  far  fewer 
than  in  linear  dynamics,  which  limit  the  possibilities  in  nonlinear  dynamics.  Numerical 
computation  (sometimes  called  “experimental  mathematics”)  plays  a  big  role  in  discovering 
the  information  listed  above  and  in  suggesting  and  motivating,  if  not  proving,  theorems 
about  a  particular  dynamics. 

1.3  “Chaos” 


x  =  f(x,t) 


X  = 


dx 

dt 


The  quotation  marks  here  signal  that  a  consensus  has  not  yet  been  reached  on  the  precise 
definition  of  this  term.  This  accounts  for  the  many  apparent  contradictions  and  fruitless 
controversies  in  this  subject.  Following  current  custom,  we  shall  mean  by  “chaos”  any  or 
all  of  the  following  properties:  sensitive  dependence  on  initial  conditions,  broadband  power 
spectra,  decaying  correlations,  or  randomness  or  unpredictability  of  orbits  as  measured  by 
positive  algorithmic  complexity  or  entropies  of  the  various  kinds.  These  properties  are  not 
all  independent. 

Of  these,  sensitive  dependence  (on  initial  conditions  understood),  abbreviated  in  this 
report  as  SD,  is  by  far  the  most  important  ingredient  of  “chaos.”  In  fact,  in  its  strong, 
or  exponential,  form  SD  is  accepted  by  most  as  the  definition  of  chaos.  The  intuitive 
meaning  of  SD  is  the  unpredictability  -  or  uncomputability  -  in  principle  of  some  orbits. 
That  is,  inevitable  errors  in  initial  conditions,  no  matter  how  small,  may  get  magnified  on 
computation,  so  that  the  computed  orbit  (or  some  observable  function  of  the  orbit)  bears 
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no  resemblance  to  the  actual  orbit  (or  function  thereof).  This  has  nothin''  to  do  with  noise 
or  perturbations  from  outside  the  system.  Sensitive  dependence  is  an  intrinsic  property 
of  the  dynamics  in  some  parameter  regimes:  it  is  true  “deterministic  chaos.”  Obviously, 
this  bears  on  the  ancient  philosophical  dichotomy  between  determinism  and  chance  (and 
seems  at  first  sight  to  contradict  it?). 

In  a  system  which  displays  “chaos,”  there  may  be  several  sequences  of  regimes  leading  to 
“chaotic”  behavior,  several  “paths  to  chaos,”  so  to  say.  The  universality  of  these  various 
paths  in  systems  superficially  very  different  (for  example,  iterating  one-dimensional  maps 
and  viscous,  incompressible  fluid  flow)  is  a  surprising  theoretical  and  experimental  result. 

To  give  the  reader  a  preliminary  feeling  for  sensitive  dependence,  this  perhaps  most 
important  concept  of  nonlinear  dynamics,  we  shall  illustrate  it  on  the  ample  dynamics 
of  a  ID  (one-dimensional)  map.  The  other  attributes  of  “chaos”  mentioned  above  will 
be  covered  later  in  the  main  text.  Consider  the  particular  ID  map  F(x)  =  /ix(l  —  x ) 
with  0  <  x  <  1  and  0  <  ft  <  4,  that  is,  the  iteration  scheme  x„+j  =  ;tx„(l  —  x„).  n  = 
0, 1,2,3,  -  -  ■ ,  defining  the  orbit  (1-2).  Choose  the  parameter  value  /i  =  4  and  substitute 
xn  =  sin2  n0n,  0  <  0n  <1.  Then  the  iteration  scheme  takes  the  form  sin2  wdn+ j  = 
4  sin2  tt0„  cos2  ~d„,  that  is, 

0n+ i  =  2d„(modl)  (1  -  3) 

where  (modi)  means  that  any  integral  part  of  20n  is  chopped  off  so  that  the  result  lies  in 
the  interval  (0,1).  We  can  actually  get  an  “analytic”  solution  (!)  for  this  parameter  value, 
namely 

Qn  =  2n  do  (modi), 

where  60  6  (0, 1)  is  the  initial  value.  Now  shift  the  initial  point  slightly:  d{,  =  d0  +  e  ;  then 

0’n-0„  =  2ne  =  cenln2 

(as  long  as  2ne  <  1),  that  is,  exponential  separation  of  the  two  initially  very  close  orbits 
with  Lyapunov  exponent  In  2.  Obviously,  the  “error”  in  the  orbit  will  get  big  for  n  large 
enough,  no  matter  how  small  e.  This  is  SD  (in  particular,  exponential  SD). 

To  be  more  quantitative  about  the  SD,  write  the  initial  value  d0  in  binary  notation.  For 
example, 

d0  =  1/2  + 1/4  +  1/16  +  1/128  +  -••  =  0.1101001---.  (1-4) 

Then  iteration  algorithm  (1-3)  amounts  to  shifting  the  “decimal  point”  to  the  right  by  one 
and  dropping  the  digit  to  the  left  of  this  point.  For  the  value  (1-4), 

d0  =  . 1101001  •••,  di  =.101001---,  d2  =  .01001  ••-,  O3  =  .1001  -  -  - ,  etc. 

Wc  see  that  0n  depends  on  the  ( n  +  1  )st  and  nigher  digits  of  do,  so  when  n  is  large,  the 
value  of  0n  depends  extremely  sensitively  on  the  precise  value  of  0o.  For  instance,  let  d0  and 
0'o  differ  first  in  the  (n  +  l)st  place,  where  0o  has  a  0  and  0'0  has  a  1.  Then  0'o  -  80  =  2~n 
at  most  («<  1  for  large  n).  But  0n  =  . O---  and  0'n  =  .1---,  so  that  they  could  differ 
by  as  much  as  1,  or  the  whole  domain  (0,1)  of  the  logistic  map  for  0  <  fi  <  4.  On  a 
digital  computer  with  capacity  2N  bits,  the  computed  orbit  for  a  given  d0  has  in  general 
no  resemblance  to  the  real  orbits  for  times  n>  N. 

Ex.  1.1  Take  do  =  1/7.  Then  we  know  that  the  exact  orbit  is 

1/7,  2/7,  4/7,  1/7,  2/7,  4/7,  1/7—,  (1-5) 

that  is,  a  periodic  orbit  of  period  3.  Nov/  perform  the  iteration  (1-3)  on  a  pocket  calculator 
or  computer  and  compare  with  (1-5)  for  large  n. 
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2.  FUNDAMENTALS  OF  CONTINUOUS  TIME  SYSTEMS 
2.1  Flows 

A  system  of  N  first  order  ordinary  differential  equations  in  time  t. 

£  =  /CM),  x€^t  (2-1) 

defines  a  flow.  Here  we  have  taken  the  flow  to  be  in  3 lN  =  the  set  of  all  real  J\7- tuples 
(xi  .x2;' "xiv)i  "'hich  is  the  usual  case;  the  function  /,  which  thus  has  N  components 
(/j./2.  -/iv),  maps  3 iiV  into  3?iV.  in  symbols  /  :  3fclV  -*■  %ts\  If  f{x,t)  =  f(x) 
does  not  explicitly  depend  on  t.  the  flow  is  called  autonomous.  3?iV,  or  the  subset 
of  -RAT  in  which  the  flow  is  confined,  is  called  the  phase  space  of  the  flow.  A  so¬ 
lution  x(t)  =  (x3  (t),  rr2  (t),  -  -  -  x  jv(£))  of  the  flow  (2-1)  with  initial  value  x0  =  x(0)  = 
(ij  (0),  x2(0).  -  -  -  xj\-(0))  is  called  the  orbit  A  graph  of  all  orbits  or  some  subset  of  them  in 
phase  space  is  called  a  phase  portrait,  and  is  useful  to  visualize  the  flow  as  a  whole  in  the 
neighborhood  of  some  interesting  point  or  other  structure. 

Orbits  of  an  autonomous  flow  do  not  intersect!  Every  point  in  phase  space  lies  on  one  and 
only  one  orbit.  This  comes  from  a  beautiful  theorem  on  the  uniqueness  of  orbits,  see,  say, 
GH  (1983,  Th.  1.0.1)  which  states  precisely: 

Let  /  be  C1  in  3£jV.  For  any  open  set  U  C  3£'v,  3  a  time  interval 

(— c,c)  such  that  the  orbit  <j>t(x o)  exists  and  is  unique  for  every  xq  £  U.  (2  —  2) 

For  technical  mathematical  symbols  and  terms  here  and  hereafter,  consult  the  mathemat¬ 
ical  Glossary  at  the  end  of  the  Guckenheimer  and  Holmes  reference  above.  We  shall  use 
the  symbol  3,  “there  exists,”  quite  often.  We  shall  usually  assume  the  hypotheses  of  this 
theorem  fulfilled  for  our  flows,  so  that  orbits  like  those  shown  in  Fig.  2.1  are  excluded. 


Fig.  2.1  Excluded  orbits 
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The  reader  might  think  that  restricting  our  dynamics  to  autonomous  flows  (as  we  shall 
do)  is  much  too  narrow.  It  seems  to  rule  out  higher  order  motion  equations,  for  example, 
second  order  equations  such  as  Newton’s  laws  deliver,  all  cases  with  forcing  terms,  and 
so  on.  But  tins  is  not  so.  By  enlarging  our  phase  space  we  can  include  those  cases  too. 
An  example  will  make  this  clear.  Consider  the  nonautonomous,  second  order  dynamics 
defined  by  x  +  x  =  a  cos  cut,  a  harmonically  driven  linear  harmonic  oscillator  with  position 
coordinate  x.  Set  x\  =  x.  x2  =  x,  x3  =  cut.  Then  we  get 

ij  =  X2,  x2  =  —x\  +  acosx3,  x3  =  cu  . 

But  this  is  just  an  autonomous  flow  in  3ft3!  In  particular,  even  without  the  forcing  term 
(a  =  0),  the  phase  space  is  3ft2,  not  3ft1  =  3ft;  phase  space  is  the  space  of  position  and 
velocities  (or  momenta),  so  it  has  dimension  2m  for  a  configuration  space  of  dimension  m. 
Hence  without  loss  of  generality  we  shall  assume  all  flows  autonomous  hereafter. 

2.2  Linear  Stability  Analysis 
2.2.1  Case  of  Linear  Flows 

Consider  the  linear  autonomous  flow  x  =  Ax,  where  A  is  a  real  N  x  N  matrix.  We 
treat  the  case  which  usually  occurs  in  applications:  A  can  be  diagonalized  by  a  similarity 
transformation 

T~l  AT  =  A  ,  (2-3) 

where  A  is  diagonal  with  the  eigenvalues  Ai,A2,---A/v  of  .4  on  the  diagonal.  Thus  the 
corresponding  eigenvectors  ei,  e2,  •  •  *  &n-> 

Aei  =  A;e;  ,  z  =  l,2,---Ar,  (2-4) 

are  linearly  independent  ( span  3 ftw),  and  the  columns  of  T  arc  the  components  of  these 
eigenvectors.  (We  prefer  to  regard  A  as  a  linear  operator  and  the  e,-  as  vectors,  basis- 
independent  concepts;  nevertheless,  entirely  equivalently  one  can  interpret  A  as  an  N  x  N 
matrix,  the  e,  as  N  x  1,  or  column,  matrices,  and  Ac,-  as  matrix  multiplication.)  The 
secular  equation 

\A  —  Al|  =  det(A  —  Al)  =  0,  (2-5) 

where  1  =  unit  matrix,  determines  the  eigenvalues  A,-. 

The  completely  general  case,  when  the  eigenvectors  of  A  may  not  span  3ft w,  so  that  A  is  not 
diagonalizable,  is  treated  in  appendix  A.  A  can  still  be  put  into  a  simple,  standard  form 
(Jordan  canonical  form)  by  a  similarity  transformation,  and  the  resulting  linear  stability 
analysis  is  not  essentially  different  from  the  diagonalizable  case. 

We  now  define  some  important  subspaces  of  phase  space.  Divide  the  eigenvectors  into 
three  subsets, 

{«i,«2,---wjvu}  such  that  ifteA;  >  0  , 

{vi,V2,-  •  -u/v,}  such  that  JfteA;  <  0  ,  (2-6) 

{ti>i , tt»2 r”wNe}  such  that  Re\{  =  0  , 
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with  Nu  +  N,  +  NC  =  N.  Then  define 

Unstable  subspace  Eu  =  span{ui ,  •  *  •  unu  }, 

Stable  subspace  E*  =  span{v\ ,  t>2,  *  *  *  ujv, },  (2  —  7) 

Center  subspace  Ec  ■:=  span{w\,W2,  -  -  *  WjVe}- 

The  reason  for  the  nomenclature  is  this:  we  assert  that  every  o.  bit  based  at  xQ  G  E3  decays 
exponential!}'  in  t;  every  orbit  based  at  xq  G  Eu  blows  up  exponentially  in  t\  and  every 
orbit  based  at  xq  G  Ec  is  constant  in  f,  as  t  — *  +oo.  We  also  claim  that  each  subspace  is 
invariant  (carried  into  itself)  under  the  flow.  Both  of  these  assertions  are  easily  seen  by 
noting  that  the  solution  of  x  =  Ax  is  x(t)  =  exp(tA)xo-  Taking  xq  =  Cj  •  ej  in  the 
stable  subspace  E3,  for  example,  we  see  that  the  orbit  is 

cxp(t A)xo  =  exp(f.4)  ^  CjCj  =  ^  Cj  exp (tA)ej  =  ^  CjelX}  ej  G  E3  ,  (2  —  8) 

Q.E.D.  Moreover,  since  Re\j  <  0,  the  length  j|rc(t)||  — >  0.  Similarly  for  .To  €  Eu, 
||x(t)|j  — *  +oo;  for  xQ  G  Ec ,  ||x(t)||  =  ||a:o||  =  const. 

A  word  on  the  general  case:  the  three  subspaces  are  defined  by  (2-6)  and  (2-7),  where 
the  vectors  are  now  generalized  eigenvectors  to  the  eigenvalues  determined  by  (2-5).  One 
can  show  that  these  subspaces  are  invariant  and  that  every  orbit  based  in  E3,  Eu,  or  Ec 
decays  exponentially,  blows  up  exponentially,  or  varies  algebraically  in  t  as  t  — »  +oo.  The 
only  difference  is  that  powers  of  t  times  an  exponential  in  t  are  in  general  allowed. 

Note  that  the  point  a;*  =  0  (the  zero  vector)  is  a  fixed  point:  x*  =  0  for  the  linear  flow, 
in  fact  the  only  fixed  point.  The  phase  portrait  of  the  flow  near  the  fixed  point  can  be 
constructed,  and  the  subspaces  Eu,  E3,  and  Ec  indicated  on  the  same  graph. 

Ex.  2.1  Take  A  =  (°  _*).  The  phase  space  is  9?2  =  the  plane. 

a.  Find  the  eigenvalues  and  eigenvectors.  Is  A  diagonalizablc? 

b.  Find  Eu,  and  Ec. 

c.  Draw  the  phase  portrait  around  x*  =  0  =  (0,0),  and  indicate  the  three 
subspaces. 

Ex.  2.2  Same  question  for 

(-1  -1  0\ 

A  =  i  1  —10  ,  phase  space  is  3i3. 

\  0  0  2/ 

As  to  part  c.  of  Ex.  2.2,  here  there  is  a  pair  of  complex  conjugate  eigenvalues  and 
eigenvectors:  A+,e+  and  A_  =  A^.,e_  =  e^,  where  *  is  complex  conjugate.  Form  two 
real  vectors  ei,e2  from  complex  combinations  of  e+  and  e_;  then  ci,C2  span  the  same  real 
two-dimensional  subspace  of  3?3  as  e+  and  e_.  Express  the  real  orbits  in  terms  of  e\  and 
e2-  You  will  find  spiralling  motion. 
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2.2.2  Case  of  Nonlinear  Flows 


The  linear  stability  anstlysis  of  the  flow  x  =  /(x),  iu  general  nonlinear,  now  follows  easily 
from  that  of  linear  flows.  Consider  a  fixed,  point  x*  of  the  flow,  defined  by  x*  =  f(x*)  =  0. 
We  linearize  the  flow  about  x*.  Set  x  =  x*  +  u,  where  ||u||  is  small  in  some  sense,  and 
keep  only  terms  of  0(u)  in  the  calculation.  Substitute  x  =  x*  +  u  into  the  flow  equations, 
expand  f{x*  +  u)  in  a  power  series  in  u  about  x*,  and  keep  only  the  first  two  terms.  For 
the  0(u )  part  we  get 

u  —  Df(x*)u  ,  (2-9) 

where  Df(x*)  is  the  Jacobian  matrix  evaluated  at  the  fixed  point, 


P/(s*)l  y 


an 

dxj 


i,j  =  1,2 ,  -  -  -  iV. 


(2  - 10) 


But  (2-9)  is  just  a  linear  flow  >  ith  A  =  Df(x*  ).  So  we  find  the  eigenvalues  and  eigenvectors, 
invariant  subspaces  Eu,  E*,  Ec,  etc.;  that  is,  we  perform  the  linear  stability  analysis  for 
this  fixed  point  just  as  in  section  2.2.1.  We  expect  the  local  nonlinear  flow  around  x*  to 
be  indistinguishable  from  the  linear  flow  governed  by  A  =  Df(x*).  This  is  true  with  an 
important  proviso  to  be  made  below. 


We  work  out  an  illustrative  example:  consider  the  van  der  Pol  oscillator  x+b(x2 — l)x+x  = 
0,  b  >  0.  As  an  autonomous  flow  in  5ft2  it  reads  xj  =  x2,  i2  =  -xa  -6(xf  -l)x2,  so  h  =  x2, 
f2  =  -xi  -  b(xj  -  l)x2.  The  only  fixed  point  is  x*  =  (0,0).  The  partial  derivatives  of  / 


are 


dh 

dxi 


=  0, 


dh 

dx2 


=  1, 


dh 

dx  i 


—1  —  2bxix2, 


dh 

dx2 


=  -HA 


i), 


so  the  Jacobian  matrix  is 

D/(x*=0)=(_°1  J).  (2-11) 


The  eigenvalues  of  this  are  A±  =  6/2  ±  (62/4  - 1)  h ,  and  the  corresponding  eigenvectors  e± 
are  found  to  span  5ft2.  Now  Re\±  >  0,  so 


Eu  =  span{e4-,e_}  =  5ft2,  Es  =  Ec  =  0. 


(2  -  12) 


All  orbits  are  repelled  exponentially  from  the  fixed  point  0. 


2.3  Stability  Types  of  Fixed  Points 


We  now  have  to  elucidate  the  key  notion  of  stability  (for  a  fixed  point  here,  but  for  more 
general  structures  later),  and  to  see  what  linear  stability  analysis  has  to  say  about  it.  For 
the  fixed  point  x*  of  a  general  autonomous  flow,  which  we  assume  is  confined  to  the  open 
set  U  C  3ft w,  we  have  the  definitions  (GH,  p.3): 

The  fixed  point  x*  is  stable  if  for  every  neighborhood  V  C  U 
of  x*  there  is  a  neighborhood  Vi  C  V  of  x*  such  that  every 
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solution  x(t)  =  ^>i(xq)  Hh  Xo  €  Vi  is  defined  and  <E  V  for 
all  t  >  0. 


(2 -13a) 


The  fixed  point  x*  is  asymptotically  stable  if  it  is  stable  and  for 

every  neighborhood  V  C  U  of  x*  a  neighborhood  Vj  C  V  of  x* 

exists  such  that  <f>t(x0)  —*  x*,  t  — »  +oo,  for  every  xq  6  Vj .  (2  —  13b) 

The  fixed  point  x*  is  unstable  if  it  is  not  stable.  (2  —  13c) 

See  Fig.  2.2. 


Fig.  2.2  Stability  types  illustrated:  on  the  left,  a  center  (stable  but  not  asymptotically 
stable);  on  the  right,  a  sink  (asymptotically  stable) 

The  fixed  point  has  special  names  in  the  first  two  cases:  x*  is  a  center  if  it  is  stable  but 
not  asymptotically  stable;  it  is  a  sink  if  it  is  asymptotically  stable.  It  is  useful  to  spell  out 
the  definition  of  unstable  in  positive  terms  by  negating  the  definition  of  stable.  Namely: 
The  fixed  point  x*  is  unstable  if  3  a  neighborhood  V  C  U  of  x* 
such  that  for  all  neighborhoods  Pi  C  V  of  x*  3  an  orbit  based  at 
#0  €  V\  which  #  V  for  some  t  >  0.  (2  —  14) 

Loosely  transcribed,  these  careful  definitions  can  be  phrased  as  follows.  A  fixed  point  x* 
is  stable  if  for  every  neighborhood  V  of  x*,  we  can  keep  all  orbits  in  V  forever  if  we  start 
them  close  enough  to  x*.  It  is  asymptotically  stable  if  it  is  stable  and  if  all  orbits  converge 
to  x*  if  we  start  them  close  enough  to  x*.  It  is  unstable  if  wc  can  find  a.  neighborhood  V 
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of  it  such  that  from  any  neighborhood  Y\  of  x*  contained  in  V,  no  matter  how  small,  at 
least  one  orbit  started  in  V\  escapes  from  V  at  some  time. 

To  convince  yourself  of  the  necessity  of  the  details  which  seem  to  make  the  careful 
definitions  (2-13)  unnecessarily  pedantic  and  complicated,  try  answering  the  questions 
in  Ex.  2.3. 

Ex.  2.3  Do  the  following  statements  say  anything  about  the  stability  type  of  x*? 

a.  For  every  neighborhood  V  of  x*  an  orbit  based  in  V  converging  to  x*  is 
observed. 

b.  For  every  neighborhood  V  of  x*  an  orbit  based  in  V  which  escapes  V  is 
observed. 

c.  We  can  find  a  neighborhood  V  of  x*  such  that  for  every  neighborhood  V\  C  V 
of  x*,  every  orbit  based  in  Vi  stays  in  V  for  all  t  >  0. 

2.4  Connection  of  Stability  and  Linear  Stability  Analysis 

The  general  idea  is  that  the  local  behavior  of  the  linearized  flow  around  x*  carries  over  to 
the  nonlinear  flow,  thus  selecting  one  of  its  stability  types.  To  make  this  precise,  let  us 
call  x*  hyperbolic  (sometimes  nondegenerate)  if  D/(x*)  has  no  eigenvalue  A  with  Re  A  =  0. 
That  is,  if  Ec  =  0.  If  then  Es  —  0,  x*  is  called  a  source ;  if  both  Es  and  Eu  ^  0,  x*  is 
called  a  saddle  point  See  Fig.  2.3. 
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Then  there  exists  a  homeomorphism  h  in  some  neighborhood  V  of  x*  taking  the  orbits 
<f>i(xo)  of  the  nonlinear  flow  into  the  orbits  exp(tDf(x*))uo  of  the  linearized  flow  and 
preserving  the  sense  of  the  orbits  (GH,  Th  1.3.1). 

Def.  A  homeomorphism  h:XJ  —*V  maps  open  set  U  into  open 

set  V  such  that  h  is  continuous  and  h~l  exists  and  is  continuous.  (2  —  15) 


See  Fig.  2.4. 

Homeomorphism  is  the  basic  concept  of  topology.  A  topological  property  is  one  invariant 
under  all  homeomorphisms  (thus  under  stretching*  compressing,  twisting,  etc.,  but  not 
tearing).  The  importance  of  homeomorphisms  and  topology  in  nonlinear  dynamics  is  that 
stability  is  a  topological  notion! 

Thus  in  the  hyperbolic  case  the  linearization  determines  the  asymptotic  time  behavior  of 
the  nonlinear  flow  and  hence  x*  ’3  stability  type.  We  have 

x*  is  asymptotically  stable  iff  only  Es  ^  0  (x*  a  sink), 
x *  is  unstable  iff  Eu  ^  0  (2*  is  a  source  or  a  saddle  point). 

(Hyperbolic  case)  (2  —  16) 


Iff  =  “if  and  only  if.”  The  center  case  is  absent.  These  are  the  only  two  possibilities.  If 
Re  A  =  0  for  any  eigenvalue  of  the  Jacobian  matrix  (Ec  ^  0),  the  stability  of  x*  cannot  in 
general  be  determined  by  the  linearization. 


Fig.  2.4  Homeomorphism  from  the  nonlinear  to  the  linearized  flow  around  a  fixed  point. 
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2.5  Topological  Equivalence  of  Flows 

The  above  section  motivates  the  introduction  of  another  topological  notion  which  will  be 
useful  when  we  come  to  discuss  bifurcations.  First  we  give  the  rigorous  definition,  then 
try  to  give  the  reader  an  intuitive  feeling  for  it  by  some  remarks.  Consider  two  flows  in 
x  =  /( x),  x  €  X,  and  y  —  g(y),y  €  F,  with  orbit  functions  <f>{  and  respectively. 

Def.  The  /-flow  and  the  (/-flow  are  topologically  equivalent  iff 

there  exists  a  homeomorphism  h:  X  —*Y  such  that  for  every  <i, 

ho  ^  =  <f>f3  o  h  for  some  <2-  (2  —  17) 

Here  o  is  functional  composition,  namely  /  o  g(x)  ~  f(g(x)).  If  the  homeomorphism  is 
realized  by  the  function  y  =  h(x),  this  means 

KHi(x))  —  (&(*))>  all  ®  €  X.  (2  -  18) 

That  is,  there  exists  a  1-1  bicontinuous  map  which  takes  every  point  *  on  an  orbit  of  the 
/-flow  at  time  ti  into  a  point  y  =  h(x)  of  the  orbit  of  the  (/-flow  at  a  time  1 2.  See  Fig.  2.5. 

More  briefly,  h  maps  every  orbit  of  the  /-flow  into  an  orbit  of  the  (/-flow  in  a  continuous 
manner,  and  vice-versa.  Stated  yet  another  way,  the  /-flow  can  be  continuously  deformed 
into  the  (/-flow,  and  vice-versa.  Thus  the  two  flows  are  really  the  same  up  to  a  “change  of 
coordinates,”  as  (2-17)  validates.  In  this  language  the  theorem  in  section  2.4  says  that  the 
nonlinear  flow  is  topologically  equivalent  to  the  linearized  flow  around  a  hyperbolic  fixed 
point. 


Fig.  2.5  Mapping  of  topologically  equivalent  flows 
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2.6  Stable  and  Unstable  Manifolds  of  a  Fixed  Point 

These  are  generalizations  of  the  stable  and  unstable  subspaces  of  the  linearized  flow  to  the 
full  nonlinear  flow.  We  shall  give  loosely  stated  definitions  sufficient  for  our  purposes. 

Tlu*  (global)  stable  manifold  W*  of  a  hyperbolic  fixed  point  x*  is  the  sot  of  all  those 
points  which  converge  to  x*  under  the  flow.  That  is,  all  x  £  phase  space  such  that 
<f>i(x)  — y  x*,  t  —*  +oo.  Similarly,  the  (global)  unstable  manifold  Wu  of  x*  is  the  set  of 
all  those  points  which  “diverge  from  x*  under  the  flow”  or,  more  precisely,  all  those  points 
which  converge  to  x*  as  time  runs  backwards:  <f>t{x)  — >  x*,  t  — >  — oo.  Both  W3  and  Wu 
are  invariant  sets,  where  the  general  definition  is:  set  S  is  invariant  if  <f>t(S)  C  S',  — oo  < 

t  <  oo.  See  Fig.  2.6,  which  illustrates  W$  and  Wu,  both  ID  (one-dimensional)  for  a  2D 
flow,  and  also  their  relations  to  E3  and  Eu. 

Then  we  have  the  Stable  Manifold  Theorem  (GH,  Th  1.3.2),  which  states  that  W9  and 
Wu  have  the  same  dimensions  as  E3  and  Eu,  respectively,  and  are  tangent  to  them  at  the 
hyperbolic  fixed  point  x*.  Figure  2.6  illustrates  this. 

We  detail  some  properties  of  W3  and  Wuin  the  following.  Since  an  orbit  is  an  invariant 
set  and  orbits  do  not  intersect,  we  see  that  W3  and  Wu  are  composed  of  entire  orbits. 
Clearly,  given  two  (hyperbolic  understood)  fixed  points  x*,  y*,  W9(x*)  and  W3(y*)  cannot 
intersect,  by  definition.  The  same  goes  for  W“(:r*)  and  Wu(y*). 

But  Ws  and  Wu  of  two  different  fixed  points  or  of  the  same  fixed  point  may  intersect 
without  contradiction.  These  intersections  must  of  course  be  composed  of  whole  orbits 
common  to  both  TFS  and  Wu.  Figure  2.7  illustrates  some  of  these  properties. 

2.7  Bifurcations 


In  section  1.2,  we  described  a  bifurcation  as  a  “qualitative  change  in  the  flow.”  But  what 
does  this  mean  precisely?  If  the  orbits  are  all  straight  lines  and,  by  changing  a  parameter, 
suddenly  become  wavy  lines,  is  this  a  bifurcation?  Fortunately,  we  now  have  the  topological 
tools  at  hand  to  pin  this  notion  down  precisely.  But  first  we  motivate  the  general  definition 
by  treating  local  bifurcations  near  fixed  points. 

2.7.1  Local  Bifui’cations  Near  Fixed  Points 


Consider  a  ID  real  flow  x  =  Jtt(x),  x  €  where  //  is  a  parameter.  Let  x*  be  a  fixed  point, 
and  consider  the  linearized  motion  (2-9).  The  Jacobian  matrix  in  this  case  is  the  ordinary 
derivative: 


DM**) 


4M*) 


dx 


=  A, 


x=x* 


(2  - 19) 


the  eigenvalue  itself.  Thus  in  the  case  A  ^  0  (note  A  is  real)  we  have  the  hyperbolic  case, 
and  the  stability  type  of  x*  is  completely  determined  by  the  principle  (2-16).  Namely,  x* 
is  asymptotically  stable  if  A  <  0  and  unstable  if  A  >  0. 
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Fig.  2.6  Relation  of  Ws  and  Es  and  of  W“,  En  of  a  fixed  point. 


this  is  out! 


fVS(x*)  =  fTw(x*) 


homoclinic  orbit 


^w(xJ)  a  IVs  0<2> 


*f  x2 

heteroclinic  orbit 


A  saddle  point: 
five  different 
non-intersecting  orbits 


Fig.  2.7  Top:  forbidden  transverse  intersection  of  stable  manifolds.  Bottom:  allowed 
cases  of  stable  and  unstable  manifolds. 


a.  ID  laser  equation 


x  =  ax  —  kx2,  k  >  0  fixed,  a  _  0  variable. 


(2  -  20) 


x  has  the  interpretation  of  the  number  of  laser  photons,  or  intensity,  so  phase  space  is 
0  <  ;t;  <  oo.  The  fixed  points  are  =  0,  x*z  =  «/«.  A  =  a  —  2k-. r*,  so  \\  =  rv,  A^  —  —n. 
Thus  for  a  <  0,  x*  is  asymptotically  stable,  is  unstable.  For  or  >  0,  is  unstable, 
x*  is  asymptotically  stable.  The  value  a  =  0  is  the  bifurcation  value  of  the  parameter, 
separating  these  two  different  stability  regimes.  This  is  displayed  in  a  bifurcation  diagram , 
which  plots  the  fixed  points  against  the  parameter  and  indicates  their  stability.  See  Fig. 

2.8. 


stable - 

unstable - 

^-*i=0 

c* 

/V-  x*= 

^  x2  K 

0 

« - > 

Fig.  2.8  Bifurcation  diagram  for  the  ID  laser 


b.  Pitchfork  bifurcation 


x  =  jj.x  —  x3,  fi  _  0  variable.  (2  —  21) 

< 

Here  -oo  <  x  <  oo.  The  fixed  points  are  a:*  =  0,  =  dCy/fi.  \  =  (i-  3(x*)2,  so  Ai  =  fi, 

A±  =  —2 fi.  Hence  for  fi  <  0,  x\  is  asymptotically  stable  and  the  fixed  points  x±  don’t 
exist  (since  phase  space  is  real).  For  fx  >  0,  a:J  is  unstable,  x±  are  asymptotically  stable. 
We  see  that  (i  =  0  is  the  bifurcation  value,  and  the  bifurcation  diagram  has  the  form  in 
Fig.  2.9. 

The  linearized  motion  in  the  ID  case  is  u  =  extuo,  so  the  phase  portraits  are  as  shown  in 
Fig.  2.10  in  the  different  stability  regimes. 
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Fig.  2.9  Pitchfork  bifurcation  diagram 


Fig.  2.10  Phase  portraits  for  the  ID  flow  (2-21) 

c.  The  Brusselator  (a  model  chemical  reaction) 

X  =  A-(B  +  l)X  +  X~Y ,  Y  =  BX  -  A' 2 7,  (2 -  22) 

where  the  phase  space  point  x  =  (X,7),  X,Y  >  0,  is  in  the  first  quadrant  of  the  plane. 
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The  sole  fixed  point  is  x*  =  (X*,Y*)  —  (A,  BA  1 ). 


hence  the  eigenvalues  are  X±  =  —A/2  ±  \J A2/4  —  A2,  where  A  =  A2  +  1  —  B.  We  take 
A  as  the  variable  parameter  and  hold  A£  0  fixed.  We  see  that  for  A  >  0,  Re\±  <  0;  for 
A  <  0,  Re X±  >  0.  Hence  we  have  the  hyperbolic  case;  and  x*  is  asymptotically  stable  for 
A  >  0  (B  <  A2  +  1),  x*  is  unstable  for  A  <  0  {B  >  A2  + 1).  A  =  0  is  the  bifurcation 
value.  We  have  a  new  situation  here.  For  B  <  A2  +  1  we  have  a  single  sink  (or  attractive 
fixed  point).  For  B  >  A2  +  1  this  becomes  unstable,  a  source  (or  repelling  fixed  point). 
What  happens  to  the  flow  then,  since  there  are  no  further  fixed  points  to  converge  to?  The 
answer  is  that  the  flow  converges  again,  but  to  a  one-dimensional  set,  a  limit  cycle,  that 
is,  a  periodic  orbit.  See  the  phase  portraits  in  Fig.  2.11. 


Motivated  by  these  examples,  we  generalize  to  the  definition:  a  local  bifurcation  at  the  fixed 
point  a:*  of  the  flow  f^{x)  occurs  when  an  eigenvalue  A  of  Dftl(x*)  crosses  the  imaginary 
axis  in  the  complex  eigenvalue  plane.  Let  it  cross  at  the  bifurcation  value  m,  of  the 
parameter.  Then  ReX(fib)  =  0,  and  this  zero  is  isolated. 

2.7.2  Bifurcation.  General  Definition 

Look  at  the  phase  portraits  in  Figs.  2.10  and  2.11.  The  flows  are  really  “qualitatively 
different”  before  and  after  the  bifurcation.  What  we  mean  is  that  there  is  no  way  we  can 
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continuously  deform  one  flour  into  the  other.  This  is  the  key  idea,  bringing  in  the  notion 
(2-17)  of  topological  equivalence. 

Def.  The  parameter  value  //*  of  the  flow  x  =  f?{x)  is  called  a 
bifurcation  value  if  the  flow  is  not  structurally  stable  at  ft  =  p*. 

This  means  that  there  exist  arbitrarily  small  perturbations  Sif(x)t 

&if(x)  of  /|«k(x)  such  that  the  perturbed  flow  x  =  /^(x)  +  ^i/(x)  is 

not  topologically  equivalent  to  the  perturbed  flow  x  =  /Ft(x)  -f  &zf{x).  (2  —  23) 

This  is  still  not  completely  unambiguous,  since  the  nature  of  these  perturbations  must 
be  spelled  out  (see  GH,  definitions  1.7.1  and  1.7.4),  but  will  suffice  for  us.  Clearly  this 
is  what  happened  in  the  local  bifurcations  of  section  2.7.1.  The  small  perturbations  8\f 
and  62/  were  generated  by  changing  the  parameter  /t  itself  to  values  slightly  above  and 
slightly  below  the  bifurcation  value  //*.  Later  we  shall  meet  global  bifurcations,  whose 
modification  of  the  flow  is  not  confined  to  the  neighborhood  of  any  point  in  phase  space, 
that  is,  “extends  to  infinity.” 

The  notion  of  structural  stability  of  a  flow  is  a  very  useful  one  in  nonlinear  dynamics.  For 
example,  if  we  know  that  a  flow  is  structurally  stable  at  a  value  ;z,  we  can  change  ft  slightly 
and  be  sure  that  the  phase  portrait  is  roughly  the  same,  has  not  suffered  any  catastrophic, 
qualitative  changes. 

Ex.  2.4  Find  fixed  points,  their  stability,  and  bifurcation  values  (if  any)  for  the 
following  real  ID  flows.  Draw  the  bifurcation  diagrams.  The  real  parameter  ft  can  be 
positive,  negative,  or  zero. 

a)  /p(x)  =  ft- x2 

b)  fM(x)  =  fix  -  x2 

c)  ffl(x)  =  ft2x  -  x 3 

d)  fn(x)  =  ft2x  +  x 3 

2.8  Dissipative  Flows  and  Attractors 

Consider  an  autonomous  flow  x  =  f(x)  in  %tN.  The  system  is  called  dissipative  in  U  C  <StN 

if  V-/  =  £<1  <  0  there.  For  f(x)  is  the  velocity  field  of  the  flow  in  phase  space,  so  that 

negative  divergence  implies  that  comoving  volume  elements  arc  shrinking  in  time.  Hence 
a  nonzero  volume  of  asymptotically  shrinks  to  volume  0  under  the  flow.  This  fact, 
that  flows  which  initially  occupy  high-dimensional  manifolds  eventually  end  up  on  very 
low-dimensional  manifolds  for  dissipative  flows,  is  one  of  the  characteristic  and  simplifying 
properties  of  these  nonlinear  flows.  Such  final  sets  are  called  attracting  sets  or  attractors. 
We  can  make  this  precise  by 

Def.  A  closed  and  invariant  set  A  is  called  an  attracting  set 

if  3  a  neighborhood  V  of  A  such  that  f>t{x)  £  V  for  t  >  0 

and  4>t{x)  — *  A,  t  — »  +00, for  all  x  €  V.  (2  —  24) 
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One  can  strengthen  the  definition  by  requiring  some  extra  properties  such  as  indecompos- 
ability  (A  contains  a  dense  orbit),  generalized  dimension  in  some  range,  "chaotic11  flow, 
etc.,  and  call  such  sets  attractors  or  strange  attractors.  We  shall  not  try  to  be  too  precise 
here  because  there  is  not  yet  universal  agreement  on  these  definitions  in  the  literature. 

Examples  of  attractors  arc  attractive  fixed  points  and  limit  cycles.  Remember  the 
definition  of  the  former  in  (2-13b).  The  notion  of  the  convergence  of  an  orbit  to  a  set 
A  such  as  a  limit  cycle,  which  occurs  in  the  definition  (2-24),  is  no  more  complicated  than 
the  usual  “c,  6”  definition  of  convergence  to  a  point;  one  must  only  use  the  distance  of  the 
orbit  point  <?t{x)  from  the  (closed)  set  A  rather  than  the  more  familiar  distance  from  a 
point  x*. 

Such  attractors  have  “volumes”  (that  is,  Lebesgue  measure)  in  3?jV  =  0,  and  thus  ordinary 
dimension  an  integer  <  N— 1,  when  they  are  not  too  pathological  to  be  assigned  an  ordinary 
dimension.  For  example,  an  attracting  fixed  point  has  dimension  0;  an  attracting  limit 
cycle,  dimension  1.  But  generalized,  nonintegral  dimension,  such  as  Hausdorff  dimension 
HD,  may  be  assigned  to  any  point  set  in  Sets  with  zero  volume  may  have  HD  any  real 
number  between  0  and  N.  For  details,  see  for  example,  Young  (1983).  In  fact,  requiring 
N  —  1  <  HD( A)  <  iV  is  a  popular  criterion  that  the  attractor  A  be  strange. 

As  we  saw  in  section  2.1,  orbits  do  not  intersect.  This  must  naturally  have  some 
consequences  in  the  cramped  phase  spaces  of  one  or  two  dimensions.  Such  theorems 
are  called  generally  “no-pass”  theorems.  They  are  too  technical  to  quote  in  this  report; 
the  reader  is  referred  to  GH,  section  1.8,  or  to  Wiggins  (1990,  section  1.1.1)  for  details. 
But  the  general  moral  of  these  theorems  is  that  ID  and  2D  flows  cannot  be  “chaotic.”  It 
is  generally  agreed  that  phase  space  dimension  jV  >  3  is  necessary  for  “chaos.”  We  remind 
the  reader  that  “chaos”  is  to  be  understood  as  explained  in  chapter  1;  a  precise  definition 
will  be  suggested  later. 


28 


3.  DISCRETE  TIME  SYSTEMS:  ITERATION  OF  MAPS 

3.1  Iteration  of  Maps 

Consider  a  function,  or  map,  F  mapping  %lN  into  %tN  which  is  continuous  with  continuous 
first  derivatives.  We  write  F  :  3^  — »  F  €  C1.  It  determines  a  dynamics  in  which 

“time”  is  discrete  by  iteration,  namely 

xn+1  =  F(xn),  n  =  integer.  (3  - 1) 

We  shall  occasionally  call  the  map  dynamics  (3-1)  a  cascr,cLe.  Thus  the  nth  point  of  the 
orbit  in  terms  of  the  initial  point  xq  is 

n  times 

■  ■■“ . /s-™- . 11  ■  "■v 

xn  =  Fn(x  o),  where  Fn(x  o)  =  FoFoFo---o  F(x0) 

=  F(F(F(-  •  •  F(x  0)  •  •  •))),  F°(x  0)  =  *0.  (3-2) 

The  symbol  F oG  denotes  functional  composition  F o  G(x)  =  F(G(x))  (don’t  confuse  Fn, 
the  nth  iterate,  with  the  nth  power). 

Note  that  Fn'(x o)  =  F'(xn-i)Fn~1'(x o),  where  prime  means  derivative,  by  the  chain  rule 
of  differentiation,  if  the  phase  space  is  &.  By  induction, 

Fn'(x0)  =  F'(xn-1)F'(xn-2)...F'(x0)  (Chain  Rule).  (3-3) 

In  the  case  of  ,  replace  F'  =  dF/dx  and  Fnl  by  DF  and  DFn,  the  Jacobian  matrices 
(2-10),  and  (3-3)  implies  matrix  multiplication. 

Again,  orbits  do  not  intersect  under  certain  lenient  conditions,  cf.  the  theorem  (2-2)  for 

flows.  If  F  has  an  inverse  F~l  which  is  C1,  the  orbit  based  at  .to  is  the  set  (3-1)  for 

-oo  <  n  <  oo  ;  if  F  is,  however,  noninvertible,  (3-1)  with  n  =  0, 1, 2,  •  •  •  is  the  orbit.  In 
(3-2)  and  (3-3)  n  was  tacitly  taken  positive. 

3.2  Linear  Stability  Analysis 
3.2.1  Linear  Maps 

If  F(x)  =  Ax,  A  =  N  x  N  real  matrix,  the  map  is  linear.  If  we  can  diagonalize  A,  we 
define  the  unstable,  stable,  and  center  subspaces  by  dividing  the  eigenvectors  into  the  three 
subsets 

{ui,U2,*--unu}  suchthat  |A,-|  >  1, 

{vi,V2,-*-vjv,}  suchthat  |A;|  <  1,  (3-4) 

•  •  •  t vnc}  suchthat  |A,|  =  1, 

with  Nu  +  N3  +  Nc  =  N.  Eu,  Es,  Ec  are  invariant,  and  vectors  based  in  them  blow  up, 
decay,  or  vary  “algebraically”  in  length  respectively  as  n  -»  +oo.  For  now  the  explicit 
orbit  solution  is  xn  =  Auxq.  Hence  for  To  =  J2cjej  belonging  to  one  of  these  subspaces, 

Xn  ~  A  CjGj  =  CjW-Gj,  (3  —  5) 
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from  which  both  assertions  follow. 

Again,  the  general  case  A  not  diagonalizable  is  not  essentially  different.  See  appendix  A. 

Exs.  3.1  and  3.2.  c*me  as  Exs.  2.1  and  2.2,  but  let  the  given  matrices  A  define 
linear  map  dynamics. 

3.2.2  Nonlinear  Maps 

Again,  we  can  use  the  linear  stability  analysis  for  linear  maps  by  linearizing  about  a  fixed 
point  x *  of  the  map  F  :  F(x*)  =  x*.  Set  xo  =  x*  +  u,  ||u||  small,  and  xn  =  x*  +  u„. 
Substituting  into  (3-2),  expanding,  and  keeping  only  terms  up  to  0(u),  we  get 

xn  =  Fn(x*+u)  »  Fn(x*)  +  DFn(x*)u  =  x*  +  [DF(x*)]nu  =>  un  =  [DF(x*)]nu.  (3  -  6) 

We  used  Fn(x*)  =  x*  and  DFn(x*)  =  [,DF(x*)]n  from  the  chain  rule  (3-3),  since  xo  = 
xi  =  x2  •  •  •  =  x„_i  =  x*  for  a  fixed  point.  But  (3-6)  is  just  the  orbit  for  the  linear  flow 
with  A  =  DF{x*).  So  the  linear  stability  analysis  for  the  fixed  point  is  performed  just  as 
in  section  3.2.1. 

3.3  Stability  Types  of  a  Fixed  Point 

These  are  defined  just  as  in  section  2.3  with  the  changes  <f>t(x o)  — ►  Fn(xo),  t  —*■  n. 

3.4  Connection  of  Stability  and  Linear  Stability  Analysis 

We  now  call  a  fixed  point  of  a  map  F  hyperbolic  if  DF(x*)  has  no  eigenvalue  A  with 
|A|  =  1,  or  if  Ec  =  0.  Sources  and  saddle  points  are  defined  by  the  same  language  as  in 
section  2.4  where  now,  of  course,  the  invariant  subspaces  are  identified  by  (3-4).  For  x* 
hyperbolic  there  exists  a  homeomorphism  mapping  the  orbits  of  the  nonlinear  map  onto 
those  of  the  linearized  map  in  some  neighborhood  of  x*  and  preserving  their  sense.  Thus 
in  the  hyperbolic  case  the  linearization  determines  the  stability  type,  and  we  get  the  same 
two  cases:  x*  is  asymptotically  stable  (a  sink)  iff  only  E3  ^  0;  it  is  unstable  (a  source 
or  saddle  point)  iff  Eu  0.  Again,  if  Ec  is  not  0,  linear  stability  does  not  in  general 
determine  the  stability  type  of  x*. 

3.5  Topological  Equivalence  of  Map  Dynamics 

Given  two  cascades  (map  dynamics)  defined  by  F(x),  x  €  X,  and  G(y),  y  6  Y, 

Def.  The  F-cascade  and  G-cascade  are  topologically  equivalent  iff 

there  exists  a  homeomorphism  h\  X  ->Y  such  that  h  o  F  =  G  o  h.  (3  —  7) 

This  is  simpler  than  the  definition  (2-17)  for  a  flow.  This  guarantees  that  every  orbit  of  F 
is  mapped  into  one  of  G  in  a  continuous  manner,  and  vice-versa.  The  F-cascade  can  be 
continuously  deformed  into  the  G- cascade  and  vice-versa:  they  are  not  really  di.Terent  from 
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a  topological  point  of  view.  In  particular,  the  nonlinear  cascade  is  topologically  equivalent 
to  the  linearized  cascade  around  a  hyperbolic  fixed  point. 

3.6  Stable  and  Unstable  Manifolds  of  a  Fixed  Point 

These  are  defined  by  exactly  the  same  language  as  those  for  flows,  section  2.6,  with  the 
substitutions  x)  -*  Fn(x)  and  t  -»  n.  The  Stable  Manifold  theorem  reads  the  same. 
N  B.;  W3  and  Wu  for  a  map  are  manifolds  (“continuous”  sets  like  curves,  surfaces,  volumes, 
etc.,  or  technically:  locally  just  like  for  1  <  M  <  N)  even  though  the  orbits  are  discrete 
point  sets.  This,  of  course,  is  because  we  can  start  this  hopping  orbit  anywhere  in  phase 
space  C  W3  and  Wu  for  maps  have  the  same  properties  as  those  detailed  in  section 
2  6.1  for  flows.  But  notice  an  important  difference.  A  nonempty  intersection  W*  fl  Wu 
for  flows  which  is  not  a  fixed  point  must  be  at  least  ID  since  they  must  intersect  in  whole 
orbits,  which  are  ID  manifolds  (curves).  However,  a  nonempty  IVs  0  Wu  for  a  map  can 
be  OD,  a  point  q  which  is  not  a  fixed  point,  as  depicted  in  Fig.  3.1. 
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Fig.  3.1  Transverse  homoclinic  point  q . 


Let  us  study  this  situation  a  little  more  because  of  its  great  importance  in  the  subject  of 
“chaos.”  We  consider  an  orientation  preserving  map  F,  in  for  simplicity,  with  hyperbolic 
fixed  point  x*  whose  stable  and  unstable  manifold  curves  IVs  and  Wu  intersect  transversely 
in  a  nonfixed  point  q  (Fig.  3.1).  Such  a  q  is  called  a  transverse  homoclinic  point  (to  x* 
understood).  Consider  the  orbit  based  at  q, 

(?),  {,  F(q),  F\q),  ■■■).  (3  -  S) 

Then  since  q  €  Ws  fl  Wu  and  these  manifolds  are  both  invariant ,  the  infinite  set  of  points 
(3-S)  must  also  lie  in  both  Wa  and  Wu.  Therefore  Ws  and  Wu  must  wind  between  each 
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other  (must  be  “interleaved”)  in  a  complicated  way,  intersecting  at  least  in  the  infinite  point, 
set  (3-8).  These  intersections  get  closer  and  closer  without  limit  because  F"(q)  ->  a-*  mid 
F~n(q)  -*•  x*  as  n  -*  -foo  by  the  very  definition  of  W*  and  WH,  respectively.  Such  an 
orbit  (3-8)  is  called  a  transverse  homoclinic  orbit ,  and  the  complicated  geometry  of  the 
interleaved,  infinitely  often  intersecting  W9  and  Wu  is  named  a  homoclinic  tangle.  So  Fig. 
3.1  leads  inevitably  to  Fig.  3.2. 


Fig.  3.2  Homoclinic  tangle 

It  can  be  proved  that  the  map  dynamics  is  chaotic  in  a  well-defined  sense  in  a  neighborhood 
of  any  transverse  homoclinic  point,  (see  Wiggins,  1990,  section  4.4,  top  p.  471).  We  will 
return  to  this  notion  of  chaos  in  Chap.  5. 


3.7  Periodic  Orbits 

If  we  have  a  point  *0  such  that  F"( x0)  =  but  Fm(a:o)  f  a?o,  m  <  n,  we  see  that  the  n 
points 

X\  =  F(x0),X2  =  F(xi),---,xn  =  F(Xn-l)  =  X0 

form  a  periodic  orbit  of  period  n  (or  n  —  cycle ,  or  fixed  point  of  order  n).  Each  x ,■  is  a  fixed 
point  of  Fn  :  Fn(xi)  a=  *{,  i  =  0,l,-*-,n  -  1.  The  stability  of  the  n-cycle  can  thus  be 
discussed  in  terms  of  the  stability  of  the  fixed  points  of  the  map  Fn,  a  subject  which  we 
have  already  covered,  and  is  fully  equivalent  to  it.  Note  that 

DFn{xi)  =  DF(xq)DF(xi)  •  •  •  DF(xn-i),  i  =  0, 1,  •  •  •  ,n  -  1, 

by  the  chain  rule  (3-3)  for  9lN,  so  every  fixed  point  of  Fn  belonging  to  an  n-cycle  has 
the  same  Jacobian  matrix.  So  we  end  up  with  the  characterization  of  the  stability  of 
an  n-cycle  in  the  hyperbolic  case  (no  eigenvalue  A  of  DFn(x o)  with  |A|  =  1):  the  n-cycle 
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is  asymptotically  stable  (an  attractive  n-cycle)  if  all  |A|  <  1;  it  is  unstable  (a  repelling  or 
saddle  n- cycle)  if  some  |A|  >  1.  For  the  notion  of  convergence  of  an  orbit  to  an  attractive 
periodic  orbit  7:  Fn(x 0)  — ►  7,  n  — ►  +00,  see  the  remarks  in  section  2.8. 

3.8  Bifurcation  of  Maps 

The  motivating  discussion  of  subsection  2.7.1  for  flows  applies  here  too.  So  it  is  clear  that 
the  analogue  of  the  definition  given  there  is:  there  is  a  local  bifurcation  of  the  map  at 
the  fixed  point  x*  when  an  eigenvalue  A  of  DF^x*)  crosses  the  unit  circle  in  the  complex 
eigenvalue  plane.  If  it  crosses  at  the  bifurcation  value  fib  of  the  parameter,  then  |A(/'ft)|  =  1 
and  this  value  is  isolated.  The  same  statement  applies  for  local  bifurcations  near  periodic 
points  with  the  substitution  of  F“  for  . 

The  definition  of  structural  stability  and  of  a  general  bifurcation  for  a  map  is  the  strict  ana¬ 
logue  of  (2-23),  with  the  cascade  xn+i  =  F^Xn)  replacing  the  flow  and  the  perturbations 
6iF(x ),  82F(x)  of  Fpb(x)  replacing  the  perturbations  of  fnb(x). 

Ex.  3.3  Find  fixed  points,  their  stability,  and  bifurcation  values  (if  any)  for  the 
following  real  cascades.  Draw  the  bifurcation  diagrams. 

a ) Ffl(x)  =  fi-  x2,  /t  6  3?. 

b) F,t(x)  =  iix{  1  -  x),  0  <  fi  <  4. 

c) (®,y)  »->  (y,  -x/2  +  fiy  -  y3),  fi>  0.  This  is  a  cascade  in  3ft2. 

3.9  One-Dimensional  Maps 

For  ID  maps  there  are  special  techniques  available:  we  mention  in  particular  the  graphical 
technique  for  plotting  the  phase  portraits.  In  the  xy  plane  draw  the  graph  of  the  curve 
y  =  F( x)  and  the  diagonal  line  d  :  y  =  x.  Then  an  algorithm  for  constructing  the  orbit 
is  to  draw  the  steps  bounded  by  the  line  d  and  the  graph  y  =  F(x)  as  illustrated  in  Fig. 
3.3.  The  projections  of  the  points  at  which  the  steps  meet  the  graph  onto  the  .r-axis  then 
obviously  give  the  orbit  x,F(x),  F2(x),  •  •  •. 

The  intersections  of  the  graph  and  the  diagonal  line  d  give  the  fixed  points  of  the  map.  If 
Fn  instead  of  F  is  plotted,  these  intersection  points  give  the  fixed  points  of  order  n,  that  is, 
the  points  of  the  n-cycles.  Fig.  3.3  shows  the  behavior  of  orbits  near  attractive  or  repelling 
fixed  points.  Specializing  the  criterion  of  section  3.4  to  N  =  1,  we  have  A  —  dF(x*)/dx; 
and  x*  is  attracting  if  | A J  <  1,  repelling  if  |A|  >  1.  But  since  A  is  geometrically  the 
slope  of  the  graph  at  x*t  we  can  determine  the  stability  of  a  fixed  point  (or  of  an  n-cycle) 
visually  by  noting  whether  the  slope  of  the  graph  at  the  fixed  point  (or  fixed  point  of  Fn ) 
is  less  than  or  greater  than  unity  in  magnitude.  This  can  be  seen  in  the  figure,  where  the 
influence  of  Sgn  A  on  the  orbit  is  also  evident.  Also  shown  is  the  case  A  =  0,  where  for  this 
particular  map  the  orbit  actually  converges  to  x*,  though,  as  we  remember,  linear  stability 
analysis  is  powerless  in  this  case. 
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Fig.  3.3  Graphical  method  illustrated.  Top:  attractive  fixed  points;  bottom:  repelling 
fixed  points. 

3;10  The  Logistic  Map 

We  illustrate  some  of  these  topics,  in  particular  bifurcation,  on  a  famous  and  well-studied 
ID  map 

Ftl(x)  =  //z(l  —  x),  0  <  x  <  1,  0  <  fi  <  4  .  (3  —  9) 

See  Bai-Lin  (1984,  section  14),  Feigenbaum  (1978).  For  the  shown  parameter  range  F  maps 
the  unit  interval  into  itself,  so  that  we  can  restrict  our  phase  space  to  0  <  x  <  1.  It  is 
marvelous  that  such  a  simple-looking  dynamics  (one-dimensional,  only  a  quadratic  map!) 
has  revealed  an  incredibly  rich  pattern  of  stability  regimes,  including  “chaos,”  various  parts 
of  which  show  up  in  real  nonlinear  physical  systems  of  the  greatest  complexity. 

3.10.1  Period-Doubling 

The  fixed  points  of  (3-9)  are  x*  =  0,  a^  =  (//  —  1)///.  The  eigenvalue  A  =  F;'(.r*)  = 
fi(  1  —  2a:*),  so  Aj  =  fi,  \2  =  2  —  fi.  Hence,  in  0  <  n  <  1,  a.’j  is  stable,  x*z  $  phase  space. 
(In  section  3.10  let  “stable”  be  short  for  asymptotically  stable,  since  we  shall  discuss  the 
stability  type  only  for  hyperbolic  regimes.)  In  1  <  fi  <  3,  a:*  is  unstable  and  x\  is  stable. 
Thus  Hi  =  1  is  the  first  j^local)  bifurcation  value.  That  is,  at  ft\  —  1  both  |Aj|  and  (Aal 
passed  through  1;  a:*  lost  stability  and  a^  gained  stability. 

At  h  =  3,  a?2  =  2/3  has  A2  =  —  1,  so  a:^  becomes  unstable  as  fi  increases  through  3.  So 
H2  =  3  is  the  second  (local)  bifurcation  value.  Since  there  are  no  further  fixed  points  of 
Ffi ,  what  happens?  You  can  check  that  F^  develops  two  new  real  fixed  points  a:o,  sq ,  £ 
phase  space,  that  is,  which  are  not  fixed  points  of  Ffl.  For  /i  «  3.04,  slightly  greater  than 
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H2  =  3,  xo  =  0.5984 . . .  and  x\  w  0.7306  —  Thus  we  have  a  2-cycle.  It  is  a  stable  2-cycle 
because 

|f?(*o)l  =  |fm2'(x0|  =  |f;(o:o)f;(x1)|  =  ^2|(i  -  2*0)(i  -  2*01 
«  pi?  x  0.0908  <  1  in  /i2  <  I*  <  Hz  for  some  /13. 

Here  is  the  first  example  of  a  period-doubling  bifurcation,  here  from  a  stable  orbit  of 
period  1  =  2°  to  a  stable  orbit  of  period  2  =  21 .  Now  continue  to  increase  n\  at  ^3  = 
1  +  v/6  «  3.4495 . . .  the  2-cycle  loses  stability  and  a  4  =  22-cycle  gains  stability.  Namely, 
F*  develops  four  new  fixed  points  tjj  -  not  fixed  points  of  F;|,  F2.,  or  F*  -  and  these  are 
stable:  \F*'(yj)\  <  1,  j  -  0,1, 2, 3,  while  |F2,(x;)|  >  1,  i  =  0,1,  in  fi3  <  n  <  /x4  for 
some  /i4.  This  is  the  second  period-doubling  bifurcation,  from  a  stable  21 -cycle  to  a  stable 
22-cycle. 

As  n  increases  still  further,  there  is  a  period- doubling  cascade  (“cascade”  is  here  used  in 
the  usual  sense,  not  as  short  for  “map  dynamics.”):  stable  2n-cycle  — >  stable  2n+1-cycle 


Fig.  3.4  Partial  bifurcation  diagram  for  the  logistic  map  (not  to  scale) 


Fig.  3.4  ends  at  /ioo-  We  postpone  the  complete  bifurcation  diagram  imtil  we  discuss  a 
few  more  topics. 

Consider  now  the  ratio  of  successive  parameter  intervals  between  period-doubling  bifurca¬ 
tions.  The  limiting  ratio 


6=  lim  — — ^2-J-  w  4.6692016091  ••  • 

n~*°°  Hn+ 1  “  Hn 


(3  - 10) 
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does  exist,  so  these  parameter  intervals  decrease  asymptotically  as  a  geometric  sequence. 
The  ratio  8  is  a  universal  number  in  the  sense  that  it  turns  up  in  many,  diverse  physical 
phenomena  which  show  period-doubling  behavior.  See,  for  example,  Chapter  8.  Since  the 
convergence  is  rapid,  (3-10)  can  be  used  to  predict  fin+ 1,  given  and  /*„_ i,  with  fair 
accuracy.  For  example, 

fii  «  nz  +  8~x{n 3  -  n?)  w  3.45  +  (3.45  —  3)/4.67  «  3.55, 
which  is  correct  to  two  decimal  places. 

3.10.2  Graphical  Treatment  of  Period-Doubling 


The  graphical  method  of  section  3.9  can  be  applied  to  elucidate  the  fundamental  ge¬ 
ometrical  reason  for  the  period-doubling  phenomenon.  This  adumbrates  the  ideas  of 
self-similarity,  renormalization  group,  and  fractals. 


Fig.  3.5  Bifurcations  of  the  logistic  map  by  the  graphical  method 


The  behavior  described  in  subsection  3.10.1  can  be  read  off  from  the  slopes  of  the  respective 
functions  at  their  intersections  with  the  diagonal.  The  zero  fixed  point  of  Ffl  is  at  first 
stable.  At  fi  =  1  it  goes  unstable  while  the  new,  nonzero  fixed  point  gains  stability  for  the 
range  1  <  p  <  3.  In  the  range  0  <  n  <  3  these  are  the  only  fixed  points  of  Fj*  in  phase 
space,  and  they  have  the  same  stability  for  F*  as  for  F^.  Fig.  3.6  shows  what  happens  for 
3  <  p  <  ^3  w  3.45.  The  iterated  map  F*  suddenly  gains  two  new  fixed  points  at  n  =  3. 
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They  are  stable,  and  correspond  to  the  stable  2-cycle  of  shown.  The  steps,  constructed 
by  the  graphical  algorithm,  close  on  themselves,  the  mark  of  a  periodic  orbit. 


For  n  around  ^3  we  get  the  situation  of  Fig.  3.7.  The  new  fixed  points  of  F*  of  Fig.  3.6 
go  unstable  at  fi  ~  fia,  while  two  stable  2-cycles  of  F*  develop  at  the  same  time.  The 
latter  correspond  to  four  new  fixed  points  of  F*,  which  are  stable.  They  also  correspond 
to  a  stable  4-cycle  of  F^.  Thus  history  repeats  itself;  we  can  see  the  geometric  origin  of 
the  period-doubling  cascade. 

Note  the  similarity  of  the  graphs  of  Fh  and  F£  in  the  neighborhood  of  unstable  fixed  points 
and  stable  2-cycles  for  each  graph,  that  is,  compare  Figs.  3.6  and  3.7.  Take  the  regions  V{, 
Vf  enclosed  by  the  dotted  squares  to  the  right  and  left  of  x  —  1/2  in  Fig.  3.7.  If  we  magnify 
V{  by  a  certain  factor  a  and  translate  it  to  coincide  with  the  region  Vi  in  the  dotted  square 
of  Fig.  3.6,  the  graphs  of  F?  and  F^  practically  coincide  for  the  appropriate  fi  and  fi. 

The  same  is  true  for  V-j  after  an  inversion  in  the  point  (x,y)  —  (1/2, 1/2).  And  similarly 
for  all  further  period-doublings.  The  magnification  factor  a  quickly  approaches  another 
universal  pure  number.  Thus  a  graph  displaying  all  F?",  n  =  0, 1,2,3,  •  •  •,  together,  for 
appropriate  parameter  values  fin  in  the  ranges  fin+ 1  <  fin  <  l*n+ 2,  shows  self- similarity 
on  all  length  scales.  Such  a  set  is  called  a  fractal 

3.10.3  Tangent  Bifurcations 

We  have  seen  that  the  slope  of  F*”  at  one  of  its  fixed  points  decreases  through  -1  at 

fi  =  /i„+2>  a  period-doubling  bifurcation.  This  is  a  pitchfork  bifurcation  for  Fr"  (Fig. 
3.8). 

There  is  another  very  important  local  bifurcation  for  the  logistic  map,  the  tangent 
bifurcation,  which  leads  to  intermittent  behavior.  Intermittency  is  defined  generally  as 
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regular  behavior  interrupted  by  random  bursts  of  “chaotic”  behavior.  We  can  illustrate  it 
on  the  simple  ID  map 

G^x)  =  fx  +  x  -  x2 
for  x  and  n  around  zero  (Fig.  3.9). 

For  there  is  a  narrow  neck  between  the  graph  of  and  the  diagonal  line.  The 
orbit  point  spends  a  long  time  in  this  neck  and  is  apparently  converging  to  the  false  fixed 
point  i  =  0.  Once  out  of  the  neck,  there  may  be  “chaotic”  behavior  in  the  general  system 
which  shows  intermittency  until  the  orbit  point  falls  “by  chance”  into  the  entrance  to  the 
neck  again,  and  the  regular  behavior  repeats.  As  /j,  increases  through  0,  the  bifurcation 
value  for  the  model  the  intermittent  regime  disappears,  and  a  pair  of  fixed  points,  one 
stable,  one  unstable,  is  born  (Fig.  3.10). 

This  happens  in  the  parameter  regime  <  fi  <  4  for  the  logistic  map  (3-9);  pairs  of  tri¬ 
cycles  of  odd  order  m  =  fc2n,  k  =  odd  integer,  are  born,  preceded  by  intermittency.  Thus 
we  are  saying  that  for  /j.  around  one  of  these  tangent  bifurcation  values  /ij,  the  dynamics 
of  the  logistic  map  or  one  of  its  iterates  F™  is  topologically  equivalent  to  the  dynamics 
of  the  model  map  GM,  fi  «  0,  in  sufficiently  small  neighborhoods  X  or  Xm  of  phase  space 
[0,1]. 

The  full  bifurcation  diagram  of  the  logistic  map  in  0  <  pL  <  4  is  shown  in  Fig.  3.11. 

We  have  explored  only  part  of  the  stability  structure  of  the  regime  (/^oo,4]  here;  for  more 
details  see  the  cited  specialized  works. 

However,  one  further  feature  of  this  regime  should  be  mentioned  here:  it  is  the  “chaotic” 
regime  in  the  sense  that  the  Lyapunov  exponent  x  >  0.  (We  have  already  seen  this  for  the 
single  value  /i  =  4  in  section  1.3.) 
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3.10.4  Lyapunov  Exponent  for  ID  Maps 

Let  xn  —  Fn(xo )  be  an  orbit  of  a  ID  map  F.  For  an  initially  nearby  orbit  wo  have 
x'0  =  a?o  +  e,  e  small.  Then 
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»— 1 

e  =  xK-xa2s  F*'{x 0)e  =  e  JJ  -F^xy) 

j=o 

by  the  chain  rule  (3-3).  Take  absolute  -value,  write  |e„|  =  e*-v“  |ej,  take  the  natural 
logarithm,  and  divide  by  n.  This  defines  Xn-  Then  we  have  the  definition 

1  "-1 

X  =  hm  Xn  =  lhn  -  J^ln  |F'(xi)|.  (3  - 11) 

n-^oo  w — »oo  7Z  1 

i= o 


If  this  limit  exists,  x  1S  called  the  Lyapunov  exponent  (LE)  of  the  orbit  based  at  xq.  This 
can  be  generalized  to  maps  and  flows  in  %tK  and  even  beyond  phase  spaces  C  3 but  the 
generalization  is  not  trivial.  We  postpone  this  to  Chap.  5. 

You  see  that  x  is  the  average  of  the  linearization  growth  exponent  ln|F'(x)J  along  the 
orbit.  In  fact,  if  xq  is  a  fixed  point  i*,  then  x  —  In  JA|  =  In  |F'(:£:*)|,  and  is  >  0,  =  0,  or 
<  0  according  as  |A|  >  1,  =  1,  or  <  1.  If  x  >  0  for  a  general  orbit,  then  it  is  diverging 
( initially*. )  at  an  exponential  rate  from  any  orbit  which  is  sufficiently  dose  to  it  at  some 
time  n.  This  is  a  popular  signature  of  sensitive  dependence  (SD)  in  the  physics  literature. 
Note  also  that  the  LE  of  a  stable  n-cycle  <  0  (Ex.  3.4). 

Ex.  3.4  Prove  that  x  <  0  for  an  (asymptotically)  stable  fixed  point  or  periodic  orbit. 

Prob.  3.1  For  the  logistic  map  (3-9)  and  specific  parameter  values  fi  =  1.5,  3.1, 
3.5,  3.5699,  3.6,  3.828,  and  4,  use  a  microcomputer  or  programmable  calculator  to  do  the 
following: 

a.  Plot  the  orbit  for  each  /x  for  a  typical  initial  value  xo,  0  <  x0  <  1,  and  a 
sufficiently  large  number  n  of  iterates. 

b.  Try  to  guess  the  stability  types  of  whatever  attractors,  if  any,  exist.  (Hint:  it 
is  essential  that  you  plot  the  orbits,  not  just  list  the  points  in  a  computer  printout,  so  as 
to  be  able  to  identify  the  asymptotic  [large  n,  after  transients  die  out]  behavior  “by  eye.”) 
For  example,  “stable  3-cycle”  might  be  an  answer  here. 

c.  Calculate  the  LE  x  of  each  of  these  orbits,  and  try  to  correlate  x  with  the 
observed  stability  type. 

3.11  Poincare  Maps 

A  Poincare  map  of  a  flow  is  one  of  the  most  valuable  tools  for  studying  flows  in  nonlinear 
dynamics.  It  reduces  the  dimension  of  the  phase  space  by  at  least  1;  it  reduces  continuous 
orbits  to  discrete  orbits  (easier  to  compute);  nevertheless,  it  gives  a  complete  and  equivalent 
picture  of  the  stability  or  “chaotic”  behavior  in  sufficiently  small  neighborhoods. 

A  Poincare  map  is  commonly  used  to  study  flows  in  three  situations: 

•  To  study  the  stability  of  periodic  orbits  (limit  cycles). 
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•  To  study  time-periodic  flows.  These  are  nonautonomous  flows  such  that  /(x,i )  = 
f(x,t  +  T)  for  some  T,  all  x  6  3 Is.  These  usually  arise  from  periodic  forcing  terms  in 
motion  equations. 

•  To  study  the  flow  near  a  hcmoclinic  or  hettrocUnic  orbit  These  are  orbits  which 
join  a  hyperbolic  fixed  point  x*  to  itself,  or  which  join  two  liypcrlxdic  fixed  points  x* ,  j*, 
respectively.  Such  an  orbit  must  therefore  lie  in  the  intersection  W  fl  IT*,  where  both 
manifolds  belong  to  x*  in  the  homodinic  case,  or  one  belongs  to  xj,  the  other  to  x£,  in 
the  heterodinic  case.  Such  orbits  are  not  usually  structurally  stable,  so  any  slight  change 
of  the  parameter  breaks  them  in  two.  “Chaotic”  flow  then  ensues. 

The  subject  of  Poincare  maps  is  a  huge,  mathematically  deep  one.  We  will  only  sketch  the 
key  idea  here,  and  assign  an  illuminating  problem.  For  the  theory,  see  GH  section  1.5,  or 
Wiggins  (1990,  section  1.2)  for  the  basics.  Most  of  their  succeeding  chapters  use  Poincare 
maps  as  an  essential  tool. 

Consider  the  application  (1)  above.  Let  a  flow  x  =  /(x)  in  with  orbit  function  now 
written  $(t,  x)  have  a  periodic  orbit  of  period  T.  We  pass  an  (N  —  l)-dimensional  surface 
E  through  the  flow,  cutting  7  transversely  at  xo-  Take  a  smaller  open  set  V  C  E  which 
contains  Xq.  Then  if  V  and  E  are  small  enough,  the  orbits  which  pass  through  V  will 
intersect  E  when  they  come  around  again,  in  a  time  close  to  T.  The  map  that  associates 
points  in  V  with  their  points  of  first  return  in  E  is  called  the  Poincare  map ;  call  it  P. 
Then  more  precisely, 

P  :  V  — *  E  by  x  1 — *  ^(t(x),  x),  (3  —  12) 

where  r(x)  is  the  time  of  the  first  return  of  x  to  E.  Note  r(x0)  =  T  and  <j>(T,x 0)  =  xo- 
Thus  P  maps  x0  into  x0;  x0  is  a  fixed  point  of  the  Poincare  map.  E  is  called  a  cross  section 
of  the  flow  /.  Figure  3.12  will  make  all  of  this  clear. 


Fig.  3.12  Illustrating  the  Poincare  map 


Practically  speaking,  one  must  have  a  pretty  good  idea  of  the  flow  phase  portrait  before 
being  able  to  choose  a  cross  section  intelligently.  However,  the  .choice  of  a  particular  E 
satisfying  the  definition  above  is  not  too  critical,  because  any  two  such  maps,  P  and  P\ 
are  topologically  equivalent! 

Now  consider  application  (2)  above.  The  nonautonomous  flow  is  x  =  /(x,t),  and  obeys 
f(x,t  +T)  =  /(x,<)  for  all  x.  Let  <f>t(x o)  be  the  orbit  function,  with  <£o(zo)  =  ^o-  Then 
define  the  map  P  by 

P(x0)  =  <f>T(xo\  all  xo  G  <ftN .  (3-13) 

That  is,  we  simply  sample  the  flow  at  times  nT,  n  =  0,  ±1,  ±2,  ±3,  •  •  •,  to  define  the 
map  orbit.  For  note  Pn(x o)  =  <£nx(x o).  (Of  course,  this  map  can  be  used  for  autonomous 
flows  as  well.)  Unlike  case  (1),  P  need  not  have  a  fixed  point.  If  it  does,  say  P(x*)  =  x*, 
then  x*  corresponds  to  a  periodic  orbit  of  period  T  for  the  flow.  A  fixed  point  of  order 
n  :  P"(x*)  =  x*  corresponds  to  a  subharmonic  of  the  flow  of  period  nT. 

The  map  (3-13)  is  really  of  the  Poincare  type,  as  you  can  see  by  making  the  flow 
autonomous  by  writing  x  =  f(x,d),9  =  1;  then  the  cross  section  E  is  the  iV-dimensional 
surface  6  =  0  cutting  the  flow  transversely  in  the  enlarged  phase  space  9ftw  x  S1,  where 
S1  is  the  unit  circle  (that  is,  6  is  an  angle  and  measured  mod  2k).  Then  the  time  of  first 
return  t(x)  =  T  is  the  same  for  all  x  G  =  V  =  E. 

Prob.  3.2  Consider  the  harmonically  forced,  damped,  Duffing  oscillator 

x  +  Sx  —  x  +  x3  =  ycosiot,  6  >  0.  (3  —  14) 

This  problem  is  to  be  done  a  programmable  computer  with  graphics  display.  Take  8  and 
to  fixed,  say  8  =  0  or  0.25  and  w  =  1,  and  consider  7  variable,  0  <  7  <  1. 

a.  Unforced  case  7  =  0.  Put  (3-14)  into  autonomous  flow  form  in  9ft2.  Find  the 
fixed  points,  their  invariant  subspaces  Eu,  E3,  Ec,  plot  a  phase  portrait  which  indicates 
also  the  invariant  manifolds  W3,  Wu  of  each  fixed  point.  Do  this  for  8  =  0  and  .25. 

b.  Now  let  7  be  nonzero:  consider  the  cases  7  =  0.20,  0.30,  0.40,  all  with  8  =  .25. 
Use  the  Poincare  map  P7  =  (3  —  13)  with  T  =  2k /u  =  2k.  Numerically  plot  W9  and  Wu 
for  the  saddle  fixed  point  of  P7  near  (0,0). 

Remarks  and  hints  on  Prob.  3.2:  This  is  a  fairly  ambitious  problem.  Part  (a)  can  be 
done  more  or  less  analytically  with  good  intuition.  As  to  part  (b),  luckily  the  map  P7  is 
structurally  stable  (section  3.8)  at  7  =  0,  and  so  it  continues  to  have  three  hyperbolic  fixed 
points  near  those  of  part  (a)  for  small  7.  To  plot  W9  and  Wu  numerically  for  the  saddle 
point  x*,  determine  the  directions  E9  and  Eu  and  then  iterate  a  number  of  initial  points 
on  these  directions  close  to  x*.  (Remember  that  W9  and  E3,  Wu  and  Eu  are  tangent 
at  the  fixed  point  by  the  Stable  Manifold  theorem,  see  section  3.6!)  To  get  Ws,  iterate 
P”1;  to  get  Wu,  iterate  P7.  There  is  a  (global)  bifurcation  somewhere  between  7  =  0.10 
and  0.20.  You  will  find  a  transverse  homoclinic  orbit  (section  3.6)  developing  when  W* 
and  Wu  intersect  transversely,  thus  “chaotic”  motion.  For  7  large,  «  0.40  or  greater,  the 
graphics  should  reveal  what  appears  to  be  a  strange  attractor  (see  section  2.8;  an  obvious 
analogous  definition  holds  for  maps).  It  apparently  coincides  with  Wu.  You  may  want  to 
compare  your  results  with  GH,  section  2.2. 
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4.  HAMILTONIAN  SYSTEMS 
4.1  Generalities 

The  flow 

q  =  dH/dp ,  p  =  -dH/dq,  ?,p€«n,  (4-1) 

in  3^,  N  =  2 n,  is  a  Hamiltonian  system  of  n  degrees  of  freedom.  H  =  H(q,p,t )  is 
called  the  Hamiltonian.  If  H  does  not  explicitly  depend  on  t,  the  flow  is  autonomous,  and 
H(q(t),p(t))  is  a  constant  (or  integral)  of  the  motion.  In  other  words,  H  is  conserved.  For 
we  can  easily  prove,  using  the  motion  equations  (4-1),  that 

dH/dt  =  dH/dt,  (4  -  2) 

which  equrds  zero  in  the  autonomous  case.  Then  the  flow  is  confined  to  the  (2 n  —  1)- 
dimensional  energy  surfaces  H(q,p)  —  const,  in  phase  space. 

Hamiltonian  flows  conserve  volume  in  phase  space ,  and  are  nondissipative,  since  V  •  /  =  0. 
Let  us  check  this: 


2  n  n 

v  •  / =Y.d^idxi'  =  £ 

k=l  t=l 

whether  or  not  H  depends  on  t.  Hence  they  can  have  no  low-dimensional  attractors  of 
the  dissipative  type,  where  an  initial  nonzero  volume  in  phase  space  shrinks  in  time  to 
zero  volume  (section  2.8).  Hamiltonian  flows  are  usually  considered  more  fundamental 
than  dissipative  flows,  since  microscopic  physical  motion  equations  like  (4-1)  possess  time- 
reversal  invariance  while  dissipative  flows  do  not.  (Note  the  dissipative  term  +Sx  in  the 
damped  Duffing  oscillator  (3-14),  for  example.)  Dissipative  systems  in  $N  are  truncated 
models  of  T-invariant  systems  in  a  larger  phase  space  %tN' ,  N'  »  iV,  where  the  interaction 
between  parts  of  the  larger  system  is  modelled  as  dissipative  terms  in  a  smaller  system. 

As  for  the  question  of  stability,  note  that  all  stable  fixed  points  of  (4-1)  must  be  centers 
(the  asymptotically  stable,  or  sink,  case  is  excluded).  Refer  to  section  2.3  to  recall  these 
definitions.  We  can  see  this  as  follows.  Consider  a  stable  fixed  point  x*. 


0  =  V-/ 


=  (£%/art) 


=  Tr  Df(x*)  = 


N 

=  ^  A*  =^>  ^  ReAjfe  =  0.  (4-4) 

*  *=i 

Here  the  {A*}  are  the  eigenvalues  of  the  Jacobian  matrix  at  the  fixed  point.  But  if  x* 
is  stable,  no  i?eA*  >  0  (this  seems  evident,  and  can  be  proved).  Therefore  no  Re\k  <  0 
from  (4-4),  since  the  real  parts  must  add  to  zero!  Thus  all  ReXk  —  0,  k  =  1, 2,  •  •  •  N.  This 
says  that  Eu  =  E*  =  0;  only  Ec  ^  0.  Then  it  can  further  be  proved  that  this  guarantees 
that  x*  is  in  fact  a  center  for  most  (“typical,”  “nonpathological”)  Hamiltonians.  One 
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says:  “under  generic  conditions.”  So  from  now  on  we  assume  that  all  fixed  points  of  a 
Hamiltonian  system  are  either  centers  or  saddles. 

Examples.  A  simple,  important,  autonomous  Hamiltonian  system  for  n  =  1  is  the  linear 
harmonic  oscillator  H(q,p )  =  (p2  +  q2)/ 2,  in  properly  scaled  variables.  There  is  one  fixed 
point  (0,0),  a  center,  and  all  orbits  are  periodic  with  the  same  period  r  =  27 r/w,  where 

u>  =  \fkjm  =  1  here. 

A  separairix  is  a  branch  of  Wu(p)  or  W*(p)  for  a  hyperbolic  fixed  point  p  (Fig.  4.1). 


Fig.  4.1  Two  cases  of  separatrices 

The  general  definition  of  hyperbolic  fixed  point  was  given  in  section  2.4;  for  a  Hamiltonian 
system  there  is  only  one  possibility:  p  is  a  saddle  -  the  source  and  sink  cases  are  out. 
The  harmonic  oscillator  has  no  separatrices  since  it  has  no  saddles.  However,  another, 
all-important  example  of  an  n  =  1  Hamiltonian  system  does  have  separatrices.  This  is  the 
ID  pendulum  (Fig.  4.2): 


0  =  p,  p  =  —  a  sin#  ;  H(9,p)  =  p2/2  —  a  cos  9. 


(4-5) 


Phase  space  here  is  actually  x  51,  S1  =  unit  circle,  since  9  is  an  angle,  measured  mod  27T. 
Thus,  topologically  it  is  the  surface  of  a  cylinder  rather  than  the  plane  9?2.  The  different 
fixed  points  are  two,  (0,0),  a  center,  and  (w,  0),  a  saddle.  For  ( — 7r,  0)  is  to  be  identified 
with  (7r,0),  and  (7r,7r)  with  (0,0),  etc.  The  phase  portrait  looks  like  Fig.  4.3. 

Note  the  two  separatrices,  so-called  because  they  separate  regions  of  qualitatively  different- 
orbits.  Inside  them  the  orbits  are  closed,  going  from  approximately  circular  harmonic 
oscillator  orbits  near  the  center  to  distorted  limit  cycles  whose  periods  -+  oo  as  they 
approach  the  separatrices.  (Why  is  this  obvious?)  This  type  of  periodic  motion  is  called 
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libration  ( def. I  the  orbit  is  dosed  in  phase  space).  Outside  the  separatricos  the  orbits  are 
open  curves.  This ^type  of  motion  is  called  rotation  (def.  p(q)  is  a  periodic  function).  So 
'  f  laimo™c  oscsUator  shows  only  libration,  while  the  pendulum  shows  both  libration  and 
rotation.  The  geometric  significance  of  these  two  is  obvious.  The  pendulum  librates  when 
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its  energy  is  low  enough  so  that  it  swings  back  and  forth  like  the  bob  of  a  grandfather 
clock.  When  it  has  enough  energy,  it  rotates  around  its  pivot.  The  stable  fixed  point  is 
the  pendulum  hanging  straight  down,  the  unstable  fixed  point  is  the  pendulum  balanced 
straight  up.  The  separatrices  are  the  limiting  libration  with  angular  amplitude  re  and 
infinite  period. 

For  a  good  treatment  of  Hamiltonian  systems  see  Goldstein  (1980). 

4.2  Integrable  Systems 

There  are  three  presumably  equivalent  definitions  of  an  integrable  Hamiltonian  system. 
We  give  all  three,  since  each  one  casts  a  different  light  on  the  concept.  From  now  on 
we  assume  an  autonomous  system  with  analytic  Hamiltonian  H(q,p).  For  real- valued 
functions  /  of  real  variables  a;  €  0iw,  we  have  the 

Def.  f  is  analytic  in  a  neighborhood  V  C  if  f(x )  admits 

a  power  series  expansion  in  x  about  every  point  xo  G  V.  (4  —  6) 

Thus  we  are  assuming  that  H  has  a  power  series  expansion  in  x  =  (q,p)  G  9£2n  except 
possibly  at  some  singularities. 

Def.  1.  The  system  (4-1)  is  integrable  if  the  solution  curves  <f>t(xo) 

=  (<7(9o,Po,0>P(?o>Po,i))  are  analytic  in  xq  =  (?o,Po)  and  in  t.  (4  -  7) 


This  is  the  most  fundamental  definition.  One  can  simply  integrate  the  differential  equations 
(4-1)  and  get  explicit  solutions  analytic  in  both  initial  conditions  and  in  time. 

Def.  2.  The  system  (4-1)  is  integrable  if  there  exist  n  independent 

analytic  constants  of  the  motion  Fi(q(t),  p(t ))  =  const.,  i  —  1, 2,  •  •  •  n.  (4-8) 

There  is  already  one  constant  of  the  motion,  H  itself,  (see  (4-2)).  However,  there  must  be 
n  —  1  further  conserved  quantities,  and  all  independent  (no  F{  is  a  function  of  the  other  Fj} 
j  7^  i)  if  the  system  is  to  be  integrable.  These  usually  arise  when  the  system  is  not  coupled 
to  external  fields,  which  destroy  the  symmetry  of  H  under  translations  or  rotations.  Then 
one  gets  total  linear  momentum  or  total  angular  momentum,  etc.,  conserved  as  well  as  the 
energy. 


Def.  3.  The  system  (4-1)  is  integrable  if  its  Hamilton-Jacobi  equation 
admits  a  complete  solution.  (4  —  9) 


This  is  more  technical.  The  H-J  equation  is  the  single  partial  differential  equation 

H(q,  dW/dq)  =  E  =  const.,  (4  —  10) 

for  Hamilton’s  characteristic  function  W.  A  complete  solution  of  (4-10),  W(qi ,  q2,  •  •  •  qn  ; 
7i>72>  •  •  '7n),  admits  n  independent  integration  constants  7;.  (The  energy  E  can  be 
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chosen  tp  be  one  of  these.)  We  explain  briefly  how  Eq.  (4-10)  arises,  and  refer  the  reader 
to  Goldstein  (1980)  for  details. 

Consider  an  analytical  canonical  transformation  Qi  =  Qi(q,p ),  P;  =  P»(tf,p),  i  —  1, 2,  •  •  •  n, 
with  new  Hamiltonian  defined  by  K(Q,P )  =  H(q,p ),  that  is,  such  that  the  Hamiltonian 
form  (4-1)  is  preserved: 

Q  —  dK/dP,  P  =  -dK/dQ.  (4-11) 

What  if  we  could  find  a  canonical  transformation  such  that  K  did  not  involve  the  Qi  : 
K  —  K(P),  Then  we  could  integrate  (4-11)  immediately  and  trivially ,  getting 

P,(t)  =  7;  =  consts.,  Qi(t)  =  +  /?,•  ;  a;,#  consts.,  (4  -  12) 


where  a*  (7,-)  =  ( dK/dPi ) 


Pi-  n 


.  How  would  we  find  such  a  canonical  transformation?  If 


it  were  generated  by  a  function  W(q,  P),  we  would  have 


p  =  dW/dq,  Q  =  dW/dP \ 

(see  Goldstein,  1980).  We  could  get  a  partial  differential  equation  for  W  by  rewriting 
H{q,p)  =  K(i ,■)  =  E  =  const,  as  in  (4-10).  Then  it  can  be  shown  that  a  complete  solution 
of  (4-10)  guarantees  that  the  canonical  transformation  ( q,p )  (Q,P)  can  be  inverted  to 
get  the  solution 


qi  =qi(ort  +  0,7)»  P<  =  P;(a<  + /?,  7>»  <x,P,7€$n, 

from  (4-12).  The  2n  independent  integration  constants  7 can  be  replaced  by  the  2n 
initial  values  <70*,  poi,  and  we  are  back  to  the  explicit  analytic  solutions  demanded  by  Def. 
1,  (4-7).  This  makes  a  long  story  short,  but  I  hope  that  it  gives  the  essential  idea  without 
oversimplifying. 

You  may  well  ask  why  we  don’t  simply  solve  the  H-J  equation  for  the  system  of  interest, 
thereby  apparently  getting  explicit  orbit  solutions  and  proving  any  system  integrable  by 
Def.  1.  The  answer  is,  a  complete  solution  of  the  H-J  equation  does  not  always  exist ; 
in  fact,  it  is  rare  (a  fact  not  sufficiently  emphasized  in  our  standard  classical  mechanics 
texts).  In  those  cases  the  system  is  nonintegrable.  A  famous  example  is  the  Newtonian 
3-body  problem,  which  worried  Newton  and  all  his  successors. 

Some  simple  examples  of  integrable  systems: 

a.  All  ID  systems  (n  =  1).  For,  solve  for  p  —  (f>(q)  from  H(q,p )  =  E.  Substitute  this 
into  the  motion  equation  q  =  dH/dp ,  getting  a  first  order  ordinary  differential  equation 
for  q.  Integrate. 

b.  All  linear  flows.  For,  go  to  normal  modes  (that  is,  separate  the  variables),  getting 
the  equations  <7)  +  u?qi  =  0,  i  -  1, 2, . . .  n.  Integrate. 
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4.2.1  Action-Angle  Variables 


Let  us  consider  a  special  case  of  an  integrable  system.  First  we  assume  that  the  H-J 
equation  is  completely  separable  in  some  set  of  canonical  coordinates  q,p.  This  means  that 
it  reduces  to  the  n  ordinary  differential  equations 

Hi(qi,  dW{  /  dqi)  =  E{  —  consts . ,  where 

#  =  £>((«, Pi),  (4-13) 

i  i 

Then  one  gets  the  solution  W  =  -C  Wi(qp,  a)  [a  G  3£R  are  some  general  set  of  «  independent 
constants,  functions  of  the  separation  constants  2?,-]  by  quadratures  (solve  the  ?'th  ordinary 
differential  equation  in  (4-13)  for  dWi/dqi  as  a  function  of  q ,•  and  Ej)  and  integrate). 
Second,  we  assume  that  (at  least  in  some  energy  regimes)  the  motion  is  of  libration  or 
rotation  type.  This  means,  more  precisely,  that  the  projection  of  the  motion  on  the 
2-plane  for  each  i  =  1,2,  •  •  •  n,  is  libration  or  rotation  as  defined  for  the  ID  pendulum  in 
section  4.1.  We  confine  attention  to  these  special  integrable  systems  from  now  on. 

Now  we  can  introduce  action-angle  variables  J,  0,  a  special  case  of  the  separable  coordinates 
of  (4-13).  Define  the  actions 


Ji  =  pidqi  =  j  -Q^(q>\ a)(lqi  =  J;(«),  (4  -  14) 

where  the  integral  §  is  carried  out  over  a  complete  period  of  the.  libration  or  rotation. 
Inverting  (4-14),  we  can  express  the  a;  as  functions  of  J  G  3£'\  Since  A'(0,  J)  =  H(q,p)  = 
X)  Hi  —  X);  A*  is  a  function  K(J)  of  the  .7  only.  Then  the  canonical  motion 

equations  (4-11)  have  the  simple  solutions  (4-12).  Namely, 

0;  =  dK/dJi  =  i'i(J)  =  consts.  =$>  0,(<)  =  v,t  +  /j; 

Ji  =  —dK/d9i  =  0  =£•  Ji(t)  —  J{  =  consts.  (4  —  15) 

Because  of  the  definition  (4-14)  of  the  actions,  these  particular  coordinates  have  special, 
nice  properties.  We  just  list  them,  and  refer  the  interested  reader  to  Goldstein  (1980, 
Chap.  10)  for  proofs  and  details. 

a.  The  V{  =  dI((J)/dJi  are  the  frequencies  of  the  libration  or  rotation. 

b.  The  bounded  motion,  that  is,  libration,  x(t )  =  (q(t)>p(t))  is  quasi-pcriodic  with 
the  frequencies  up. 

*(*)  =  J2  a}exv\^ij  *  {vt  +  /?)],  jt  V)  ft  consts.  G  3ft”.  (4-1G) 

j 

Here  j-u=  Y^,hui  scalar  product,  j  is  an  integer  vector ,  namely,  its  n  components 
arc  integers,  a3  are  a  set  of  constant  amplitudes  in  3ft2"  indexed  by  integer  vectors,  and  the 
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suift  iti  (4-16)  goes  over  all  integer  vectors.  We  only  mention  that  the  unbounded  motion 
(rotation)  is  similar,  except  that  it  has  an  extra  term  qo  •  (id  +  f3)  linear  in  t  in  q(t). 

Property  (a)  has  been  a.  boon  in  celestial  mechanics,  since  one  can  compute  the  fundamental 
frequencies  just  by  finding  Jv(j),  without  solving  for  the  complete  motion.  As  for  (b), 
note  that  tlic  motion  is  in  general  not  periodic,  since  the  orbits  do  not  close.  Only  in  the 
case  that  the  frequencies  are  commensurable , 

jr  •  v  —  0,  r  =  1, 2,  •  •  •  n  —  1  (commensurable  case)  (4  —  i7) 

for  h  —  1  linearly  independent  bonstant  integer  vectors  jr,  do  all  orbits  close.  This  is  easy 
to  see:  it  means  that  the  ratio  of  any  two  frequencies  is  a  rational  number.  A  special  case  is 
that  all  frequencies  are  integer  multiples  of  some  one  frequency  uq  (which  need  not  belong 
to  the  set  iq,  iq,  •  •  •  ^n)-  Then  the  orbit  (4-16)  is  a  Fourier  series  in  time  with  fundamental 
frequency  Uq. 

Prob.  4.i.  Introduce  action-angle  variables  and  solve  for  tlie  frequency  v  = 
dK(j)/dJ  for 

a.  the  linear  harmonic  oscillator, 

b.  the  ID  pendulum  (4-5).  You  may  want  to  look  at  Goldstein  (19S0)  for  hints. 
4.2.2  N-Tori 

The  moral  of  Eq.  (4-16)  is  that  the  bounded  motion  of  intcgrable  systems  lies  on  invariant 
•/i- tori.  For,  once  we  locate  an  orbit  initially  in  phase  space  by  the  2 n  numbers  qn,  p0, 
or  equivalently,  by  the  2n  numbers  Oo,  Jo ,  the  further  position  is  determined  uniquely  by 
??  angles  #,(t),  (see  (4-15)).  n-dimensionai  manifolds  where  the  point  is  specified  by  n 
angles  are  called  n-tori.  For  instance,  a  1-torus  is  a  circle;  a  2-torus  is  the  surface  of  a 
doughnut  (Fig.  4.4).  For  n  >  3,  the  n-torus  cannot  be  realized  in  our  physical  3-space, 
but  is  defined  similarly.  The  n  actions  J,-  =  Jof,  which  are  constant  along  the  orbit,  fix 
the  particular  n-torus  on  which  the  orbit  lies:  this  torus  is  invariant.  In  the  general  case 
i'i  incommensurable  ((4-17)  not  true),  the  generic  orbit  winds  up  on  the  torus  endlessly 
without  closing,  and  is  dense  there  (as  seems  reasonable,  and  can  be  proved).  Only  in 
the  commensurable  case  is  every  orbit  periodic,  and  therefore  not  dense  on  the  torus.  For 
“dense,”  see  below. 

But  this  just  means  that  the  dynamics  of  any  such  integrablc  system  is  topologically 
equivalent  to  the  dynamics  of  n  uncoupled  ID  pendula!  Indeed,  if  wc  pass  2-surfaces 
through  the  phase  space,  they  cut  the  flow  in  distorted  copies  of  the  pendulum  phase 
portrait  (Fig.  4.3).  This  explains  the  fundamental  importance  of  the  pendulum  in 
Hamiltonian  dynamics. 

This  long  and  rather  abstract  dose  of  theory  is  justified  by  the  pay-off.  First,  intcgrable 
systems  can  never  be  “chaotic.”  For  the  motion  is  at  most  quasi-periodic,  and  this  is  too 
regular  for  the  appellation  “chaotic.”  Secondly,  there  are  implications  for  the  foundations  of 
statistical  mechanics.  If  the  microscopic  dynamics  of  nature  were  intcgrable,  the  necessary 
eigodicity  assumptions  of  statistical  mechanics  could  never  be  valid.  For  intcgrable  motion 
is  confined  to  low-dimensional  manifolds,  and  so  could  never  wander  ergodically  through 
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the  whole  energy  surface,  sampling  all  states  on  it,  as  required  by  statistical  mechanics. 
That  is,  n  <  2n  -  1  unless  n  =  1.  For  a  preliminary  definition  we  can  say  that  an  orbit 
is  crgodic  on  the  energy  surface  if  it  is  dense  there:  given  any  state  x  e  energy  surface  ag 
and  any  neighborhood  V  C  (?e  of  x  no  matter  how  small ,  the  orbit  passes  through  V  at 
some  time  (or  times). 

At  this  point  it  is  natural  to  entertain  the  notion  that  nonintegrable  microscopic  dynamics 
will  save  statistical  mechanics.  Viz.y  the  nice  clean  integrable  Hamiltonians  of  our 
textbooks  will  be  perturbed  by  realistic  interactions  into  nonintegrable  ones.  The  motion 
will  then  be  generically  ergodic,  energy  sharing  among  the  various  modes  (a  mode  is  one 
of  the  terms  in  (4-16)  characterized  by  an  integer  vector  j)  will  take  place,  equipartition 
will  set  in  in  equilibrium,  etc.,  and  all  the  properties  assumed  in  statistical  mechanics  will 
follow.  To  examine  this  hopeful  view  we  now  turn  to  nonintegrable  systems. 

4.3  Nonintegrable  Systems 

A  nonintegrable  system  is  simply  one  which  is  not  integrable.  Unfortunately,  there  is  no 
simple  criterion,  such  that  we  need  only  to  look  at  a  Hamiltonian  to  see  if  it  is  integrable 
or  nonintegrable.  The  only  way  is  to  prove  that  the  system  violates  Definition  1,  2,  or 
3  of  section  4.2.  But  there  is  no  guarantee  that  we  can  do  this.  Consequently,  we  often 
work  with  Hamiltonians  which  we  suspect  (from  numerical  work,  say)  to  be  nonintegrable, 
without  being  able  to  prove  it.  ; 

In  this  section,  we  enter  an  area,  called  “dynamical  systems,”  on  which  much  research 
has  been  done  over  many  years,  unlike  most  work  on  nonlinear  dynamics,  which  is 
comparatively  new.  Some  deep  and  powerful  theorems  have  been  proved  by  great 
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mathematicians.  These  results  are  too  technical  to  give  in  rigorous  form  in  this  report.  But 
they  are  so  important  that  they  must  be  included  here,  even  if  in  nonrigorous,  paraphrased 
form.  My  aim,  as  always  in  this  report,  is  that  the  definitions  and  theorems  should  make 
sense  to  the  reader. 

First,  we  might  ask:  how  many  integrable  Hamiltonians  are  there  among  the  set  of  all 
analytic  Hamiltonians?  The  answer  is  that  they  arc  rare,  a.  sot  of  measure  0  in  an 
appropriate  measure.  Nonintegrable  Hamiltonians  are  dense  in  the  set  in  the  same  measure. 

Second,  what  is  the  nature  of  the  flow  for  a  nonintegrable  Hamiltonian?  The  most 
important  result  on  this  is  the  famous  ICAM  (Kolmogorov,  Arnold,  Moser)  theorem,  see 
Arnold  (1978),  Arnold  and  Avez  (1968),  GH  (1983,  section  4.S),  and  Wiggins  (1990,  pp. 
150-153). 

4.3.1  KAM  Theorem 

This  can  be  stated  loosely  as  follows: 

KAM  Theorem.  Given  H  =  Hq  -f  V  nonintegrable  with  Ho  integrable 
(of  separable  and  rotation-libration  type).  Introduce  action-angle 
variables,  0,  J  for  the  unperturbed  system  with  Hamiltonian 
H0(J).  Assume  a)  V"  is  “small”  in  some  sense,  and  b)  H0(J)  is 
nondegenerate, 


OHo_ 

aUi.J'j.-J.)*  '  sji' 

where  the  left  hand  side  is  the  Jacobian  determinant.  Then  for  a 

sufficiently  irrational  set  of  frequencies  v*  £  52"  there  exists  an 

invariant  torus  T(u*)  of  the  perturbed  system  H  close  to  the 

invariant  torus  %(v*)  of  the  unperturbed  system  Hq.  (4  —  18) 

Remarks,  a)  Frequencies  u*  are  sufficiently  irrational  if  u*  •  m  is  bounded  away  from  zero 
for  all  integer  vectors  m  €  32".  More  precisely, 

\u*  -m|  >  a||m||_/}  (4-19) 

for  some  fixed  a,  /?  and  all  m,  where  ||m||  is  the  length  of  vector  m.  Thus  not  only  can 
no  integer  linear  combination  of  the  u*  vanish,  as  in  (4-17),  but  these  must  actually  not 
be  too  small,  in  the  sense  of  (4-19).  b)  Sufficiently  near  a  center  of  H  (section  2.3)  the 
majority  of  the  orbits  of  H  lie  on  these  “preserved”  tori,  called  KAM  tori  (sometimes 
nonresonant,  or  irrational,  tori).  This  is  the  principal  moral  of  the  KAM  theorem.  On 
perturbing  an  integrable  system  into  a  nonintegrable  one,  we  do  not  completely  destroy  the 
regular  motion  part  of  the  motion  remains  confined  to  low  (n)-  dimensional  manifolds 
and  quasi-perioclic.  c)  The  KAM  theorem  says  nothing  about  the  fate  of  tori  T0(u)  whose 
frequencies  are  not  sufficiently  irrational,  that  is,  about  the  orbits  of  H  which  do  not  lie  on 
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the  preserved  tori  T(y*).  Might  the  motion  in  this  complement  set  be  “ergodic”  in  some 
sense?  See  the  remarks  on  statistical  mechanics  at  the  end  of  section  4.2. 

4.3.2  Poincare-Birkhoff  Theorem 

To  investigate  the  dynamics  in  this  complement  set  we  consider  the  fate  of  a  rational  torus 
Tq(ur)  of  the  unperturbed  flow,  that  is,  one  whose  frequencies  are  commensurable  (4-17), 
so  that  any  orbit  on  it  is  periodic,  and  closes.  We  can  define  a  Poincare  map  Pq  (section 
3.11)  by  cutting  this  torus  transversely  by  a  (2n  -  2)-dimensional  cross  section  E  lying  in 
the  (2n  —  l)-dimensional  relevant  energy  surface  Hq  =  E.  Let  E  cut  the  torus  in  the  level 
curve  P.  By  a  simple  theorem,  every  point  of  T  is  a  fixed  point  of  P0m  for  some  fixed  m\  we 
choose  the  minimum  such  positive  integer  m.  Now  turn  on  the  perturbation  V\  the  same 
section  E  will  define  a  Poincare  map  P  for  the  perturbed  flow  near  To(ur)  in  the  energy 
surface  H  =  E.  The  question  is,  what  happens  to  To(un),  or  equivalently,  to  T? 

For  a  Hamiltonian  map  F  we  have  det  DF(x )  =  1  everywhere,  the  counterpart  of  V  •  /  =  0 
for  flows,  because  it  conserves  volume  in  phase  space.  Hence  its  fixed  points  of  any  order, 
Fm( x*)  =  x*,  can  be  proved  to  be  either  centers  or  saddles  in  the  generic  case,  cf.  the 
similar  discussion  for  Hamiltonian  flows  following  (4-4).  For  a  map  of  3ft2  the  situation 
is  especially  simple.  Let  DFm(x*)  have  eigenvalues  Ai,  A2,  which  thus  satisfy  A1A2  =  1. 
There  are  only  two  possibilities:  0  <  A*  <  1  <  A2,  or  A2  =  Aj,  |A,j  =  1.  The  fixed  point  is 
called  a  hyperbolic  point  in  the  first  case  (saddle  point)  and  an  elliptic  point  in  the  second 
case  (center). 

At  this  point  we  specialize  to  two  degrees  of  freedom,  n  —  2.  Then  we  are  dealing  with 
a  3D  energy  surface,  a  2D  cross  section  E,  a  2-torus,  a  ID  T  (an  ordinary  curve),  and 
Poincare  maps  Po  and  P  taking  3f£2  into  SR2.  Now  we  are  ready  to  state  the  main  theorem 
in  a  paraphrased  form.  For  rigorous  statements,  see  Arnold  and  Avez  (1968),  GH  (1983, 
section  4.8),  Wiggins  (1990,  pp.  137-140). 

Poincare-Birkhoff  Theorem.  For  a  sufficiently  small  perturbation  V, 

the  level  curve  P  “breaks  up”  into  2 km  fixed  points  of  Pm  for 

some  integer  k.  These  fixed  points  lie  near  T;  km  arc  elliptic 

and  km  are  hyperbolic.  (4  —  20) 


This  situation  is  depicted  in  Fig.  4.5.  T  is  often  called  a  resonant  level  curve ,  and  a  certain 
region  around  V  containing  the  elliptic  and  hyperbolic  points  is  called  a  resonance  zone. 
Also  shown  are  the  separatrices  joining  two  adjacent  hyperbolic  points.  If  they  intersect 
transversely  in  a  homoclinic  point  as  shown,  this  leads  to  the  infinitely  complicated  set  of 
intersections  known  as  a  homoclinic  tangle  (or  stochastic  layer  in  the  physics  literature), 
which  was  discussed  in  section  3.6  for  a  single  fixed  point.  Incidentally,  the  integer  k  is 
not  predicted  by  the  P-B  theorem.  For  the  situation  km  —  3  of  Fig.  4.5,  k  =  3,  m  =  1 
or  k  =  1,  m  =  3.  The  regions  around  the  elliptic  points  bounded  by  the  separatrices  arc 
called  islands.  Figure  4.5  shows  an  “island  chain”  of  three  islands  in  the  case  m  =  3  or  a 
set  of  three  “separate”  islands  in  the  case  m  =  1. 

Take  any  elliptic  point  E  of  H.  In  any  neighborhood  of  E  there  is  a  resonant  torus  ToO'rt) 
of  Hq  (Arnold  and  Avez,  196S).  We  have  indicated  such  an  elliptic  point  E  with  a  nearby 
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Enlargement  of  one  of  the  islands  in  (b), 
showing  the  homoclinic  tangle. 

■  Cc) 


Fig.  4.5  Illustrating  the  Poincare-Birkhoff  Theorem 


i  ...  .  ,  .  - - >  n,  hi,  uuw  cuipuc  points  >  and  km.  now 

hyperbohe  points)  Figure  4.5  can  represent  this  breakup  by  the  substitutions  E  ->  E'  and 

1  r\i  7  resonantI  1ieveI  c’lrve  associated  with  T0(u'R).  But  now  the  P-B  theorem  can  be 
rS°nant;t01eW  11C1  m  near  the  P^ducing  a  breakup  of  those  resonance 
any  fsl^cl  S<>  °"  ^  TllUS  there  is  infinitel?  nestcd>  structure  within 

Further^  in  any  neighborhood  of  an  elliptic  point  of  H  there  are  nonresonant,  or  preserved 
ton  of  if,  as  mentioned  m  section  4.3.1.  Thus  there  is  both  regular  motion  (on  the preyed 
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tori)  and  “chaotic”  motion  (in  the  stochastic  layers  in  the  resonance  zones)  arbitrarily  near 
anv  elliptic  point,  which  corresponds,  as  we  remember,  to  a  stable  periodic  orbit  of  H. 

For  the  gcncralizatiou  to  n  >  2,  see  the  cited  works. 


4.3.3  Resonance  Overlap 

The  above  discussion  of  the  consequences  of  the  P-B  theorem  (which  Poincare  himself 
found  mind-boggling)  does  not  exhaust  the  complications,  the  possibilities  of  even  more 
“chaos.”  If  V  is  large  enough,  the  resonance  zones  may  overlap,  defined  precisely  as  the 
transverse  intersection  of  stable  and  unstable  manifolds  Ws(xt)  and  Il''*(x2)  for  x\  and 
x2  hyperbolic  points  from  two  different  resonance  zones.  This  is  illustrated  in  Fig.  4.6. 


Then  the  inescapable  argument  of  section  3.6  implies  infinitely  many  transverse  intersec¬ 
tions,  thus  transverse  hetcroclinic  orbits,  and  heteroclinic  tangles. 

There  are  theories  which  offer  quantitative  methods  for  handling  this  P-B  breakup  of 
resonance  zones,  in  particular,  Melnikov’s  method,  (see  GH,  1983,  sections  4.5  and  4.6  and 
Wiggins,  1990,  section  4.5).  For  further  developments,  especially  resonance  overlap,  see 
Chirikov  (1979). 

4.4  Examples  of  Nonintegrable  Systems 
Consider  first  the  “double  resonance”  Hamiltonian 


H  =  H0(Ji ,  J2)  +  fmn  cos (mOi  -  n02)  +  <jm„  co^pOi  -  t/fr, )  (4  -  21) 
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in  action  angle  variables  J,-,  0,-,  i  =  1,2;  m,  n,  p,  </  are  j>ositivc  integers.  For 
aucl  <7„„«  ^  0,  if  is  presumably  nonintegrablc,  and  if  they  are  small  enough,  the  ICAM 
and  Poincarc-Birkhoff  theorems  can  be  applied.  Construction  of  ;l  Poincare  map  P  ;uid 
explicit  numerical  integration  reveal  the  following  features  (Walker  and  Ford,  1069). 

For  low  energies  E  the  phase  portrait  shows  the  general  appearance  of  the  superposition 
of  two  resonance  zones  corresponding  to  frequencies  close  to  the  commensurable  ones  vjf* 
and  satisfying 

mvm  ~  nvR2  =  °»  Pvr1  ~  WR2  =  °-  (4-22) 

Here  the  resonant  tori  Tq(vr)  with  frequencies  and  corresjjond  to  the  two  intcgrable 
Hamiltonians  (4-21)  with  gmn  =  0  or  fmn  =  0  respectively,  whose  2- tori  can  be  plotted 
analytically.  Most  tori  are  preserved.  As  E  is  raised,  one  sees  the  breakup  of  these 
resonance  zones  in  accordance  with  the  P-B  theorem,  the  appearance  of  islands,  traces  of 
homoclinic  stochastic  layers,  etc.  For  sufficiently  high  E  resonance  overlap  occurs.  One 
notes  hetcroclinic  stochastic  layers  and  single  orbits  which  seem  to  wander  over  the  whole 
energy  surface.  The  KAM  tori  disappear. 

Consider  now  the  Henon-Heiles  Hamiltonian  (Henon  and  Heiles,  1964) 

H  —  ^(Pi  +  P2)  +  2^1  +  22)  +  9i?2  -  ^92*  (4  -  23) 

This  is  presumably  nonintegrable.  A  Poincare  map  was  constructed  by  taking  S  to  be 
the  section  9i  =  0,  qi  >  0  of  the  3D  energy  surface.  The  map  was  plotted  numerically  for 
various  E  <  1/6,  for  which  the  motion  is  bounded. 

In  general  one  sees  preserved  tori  and  broken-up  resonance  zones.  In  particular,  a)  E  = 
1/12.  Only  2-tori  are  seen,  b)  E  =  0.106.  One  sees  several  large  islands  and  two  smaller 
8-island  chains.  There  are  signs  of  a  stochastic  layer  near  separatrices.  c)  E  =  0.125. 
Some  preserved  tori.  A  random  splatter  of  points  from  a  single  orbit,  possibly  ergodic.  d) 
E  =  1/6.  No  visible  2-tori.  Irregular,  possibly  ergodic  orbits.  Sec  Fig.  4.7. 
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Fig.  4.7  Poincare  map  orbits  for  the  Henon-Heiles  system 
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5.  IVIE4SyilE§  OF  CHAOS 
54  Jri^rp^ivictiori 

“Phaos”  w§s  already  introduced  in  an  informal  way  in  Chap.l,  where  we  saw  that  although 
there  is  np  consensus  on  a  precise  definition  yet,  three  or  four  criteria  are  usually  accepted  in 
the  physics  literature  as  “signatures  of  chaos.”  The  core  idea  of  “chaos,”  however,  as  most 
would  agree,  is  sensitive  dependence,  often  abbreviated  SD  hereafter.  We  will  concentrate 
on  SD  in  this  chapter,  and.  treat  the  other  signatures  much  more  briefly. 

The  meaning  of  sensitive  dependence  (on  initial  conditions,  that  is,  initial  point  xg  in  phase 
space,  understood  hereafter)  is  the  unpredictability  of  the  orbits,  even  though  the  dynamics 
is  of  course  a  deterministic  system.  Say  we  have  a  system  which  does  possess  SD  (not  all 
of  them  do!)  and  we  wish  to  compute  the  orbit  <f>t(xo)  based  at  xo.  No  matter  how  closely 
we  approximate  xo,  it  is  in  principle  possible  (for  certain  xo)  to  make  a  fixed  gross  error 
in  the  orbit  after  some  time  t:  the  calculated  orbit  bears  no  resemblance  to  the  true  orbit 
(f>t(xq)  after  this  time.  The  term  “unpredictable”  seems  justified  for  this  property.  The 
necessity  Q?  approximating  xq  is  forced  upon  us  by  inevitable  small  errors,  by  the  finite 
resolution  of  pur  computers,  etc.  This  situation  will  not  be  cured  by  an  increase  of  103 
or  10®  or  ...  of  pur  supercomputers  of  the  future.  Also,  it  has  nothing  to  do  with  noise, 
errors  which  arise  from  perturbations  originating  outside  the  system  (although  noise  will 
compound  the  error)  -  it  is  an  intrinsic  property  of  the  deterministic  system. 

There  are  two  versions  of  SD  current  today  -  the  mathematician’s  and  the  physicist’s, 
which,  strangely  enough,  seem  to  coexist  without  being  aware  of  each  other.  The 
mathematician’s  definition  focuses  on  this  basic  idea  of  unpredictability,  the  minimal, 
irreducible  idea,  such  that  anything  weaker  would  be  admitted  as  regular,  predictable 
inotipn  by  everybody.  It  is  something  very  like  the  familiar  idea  of  the  continuity 
or  discontinuity  of  functions  at  a  point  generalized  to  apply  to  whole  orbits.  The 
mathematician’s  definition  does  not  concern  itself  with  how  fast  orbits  separate,  or  how 
long  it  takes  to  make  this  “fixed,  gross  error.”  The  physicist’s  definition  on  the  other  hand 
requires  fast  (exponential)  separation  of  orbits  in  a  certain  sense;  we  shall  call  it  exponential 
SD  This  is  partly  because  there  is  an  easy  way  (in  principle)  to  check  this  fast  SD,  namely 
by  the  calculation  of  Lyapunov  exponents.  But  the  main  reason,  I  suspect,  is  the  widely 
he)4  belief  that  exponential  SD  is  necessary  to  ensure  “chaos,”  since  the  mathematician’s 
SD,  call  it  minimal  SD,  is  known  to  include  cases  of  “trivial  SD,”  where  the  behavior  is 
clearly  not  “chaotic,”  for  example,  the  free  particle.  However,  this  belief  is  mistaken,  as 
we  shall  see.  Minimal  SD  occurs  on  certain  sets  in  phase  space;  it  is  natural  to  limit  these 
sets  in  a  certain  way.  Then  minimal  SD,  so  limited,  is  essentially  equivalent  to  chaos;  it 
has  all  the  properties  that  we  associate  with  that  term.  So  chaos  can  finally  be  defined  this 
way.  There  is  no  need  to  mention  the  rate  of  divergence  of  orbits  in  the  definition,  and  no 
reason  to  believe  that  all  chaotic  SD  is  exponential!  The  distinction  between  exponential 
and  minimal  SD  is  irrelevant,  a  false  distinction,  as  far  as  chaos  goes. 

One  advantage  of  the  minimal,  over  the  exponential,  definition  of  SD  is  that  it  is  easier  to 
carry  out  analytic  proofs  of  its  presence  or  absence  in  some  systems,  notably  QM  (quantum 
mechanics)  systems,  in  the  former  than  in  the  latter.  If  minimal  SD  can  be  shown  to  be 
absent ,  th<sn  there  is  no  chaos  in  that  system.  Proofs  that  chaos  is  absent  in  a  very 
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large  class  of  interesting  observables  in  QM  will  be  given  or  indicated  in  this  section  or  in 
appendix  B. 

A  very  interesting  and  apparently  open  question  (July  1990)  is  whether  chaos  defined  this 
way  implies  the  other  usually  accepted  “signatures  of  chaos.” 

5.2  Sensitive  Dependence 

Let  x  be  a  point  of  phase  space,  or  state ,  of  the  system.  For  flows  and  maps  whose  phase 
space  C  $lN,  a  state  is  the  real  W-tuple  (xi,X2,*-*x;v).  But  the  state  need  not  be  so 
limited.  For  example,  in  QM  x  is  the  state  vector  iff,  sometimes  written  |0),  a  “point” 
in  an  oo-dimensional  complex  vector  space  H.  The  essential  thing  is  that  there  should 
be  a  distance  d(x  1,^2)  between  any  two  states  xi,  X2,  that  is,  state  space  can  be  any 

metric  space  M.  For  $tN  we  could  take  d(x  1,^2)  =  ||xi  -  x2||  =  [Silifaii  -X2,*)2]  *,  the 
euclidean  metric.  For  QM  state  space  H,  where  x\  =  iff  1,  X2  =  V;2 

d{xx\X2)  =  ||^i  -  ^2 ||i  where  ||<£||2  =  (<f>\(f>),  (5-1) 

and  ( ij)\(f) )  is  the  scalar  product  in  Hilbert  space  H.  We  can  continue  to  call  <£t(x0)  the 
orbit  in  this  general  context. 

The  basic  (minimal,  irreducible)  idea  of  sensitive  dependence  at  state  x  £  M  is  that  there 
exist  states  x'  arbitrarily  close  to  x  such  that  the  x'  eventually  separate  from  x  by  at  least 
some  fixed  finite  distance  e  independent  of  x'  under  time  evolution.  As  emphasized  in 
section  5.1,  this  is  it!  We  do  not  require  exponentially  fast  separation  of  x(t)  and  x'(t). 
Since  this  concept  is  so  important,  let  us  put  it  into  a  precise  definition. 

Def.  The  dynamics  D  with  orbit  function  <f>t  has  sensitive 

dependence  on  initial  conditions  at  state  x  if  there  exists  an 

€  >  0  such  that,  for  any  neighborhood  M  of  x  there  exists 

an  x'  €  N  and  a  t  >  0  such  that  d((f>t(x'),  (f>t(x))  >  e.  (5  -  2) 

The  set  of  all  such  x  is  called  the  Julia  set  J  of  the  dynamics  D.  This  concept  is  very  old, 
as  nonlinear  dynamics  goes,  going  back  to  Julia  and  Fatou,  who  worked  on  the  iteration 
of  complex  analytic  maps  in  France  around  1920.  It  is  clear  that  Def.  (5-2)  guarantees 
the  fixed  error  e  in  the  orbit  through  an  x  £  J,  as  discussed  qualitatively  in  section  5.1. 

There  arises  a  vexing  question  here.  There  may  be  functions  F  of  the  state  x  which  are  at 
least  as  interesting  as  x  itself.  For  QM,  where  the  state  x  =  iff,  it  is  sufficient  to  mention 
expectation  values  of  Hermitian  observables,  the  time  correlation  of  iff(t)  and  iff(t  +  r), 
etc.  We  could  define  SD  for  the  observable  F  simply  by  using  d(F(<j)t(x')),  F(^(x)))  in 

(5-2),  where  d  is  the  distance  in  the  metric  space  M  of  the  function  F  :  M  -»  M.  The 
question  arises.*  does  the  SD  of  the  dynamical  system  D  depend  on  the  function,  or  map , 
F  in  question?  One  can  prove  that  it  does  not  if  F(x)  is  continuous  (Ingraham,  1988). 
Assuming  continuous  maps  F  always,  we  restrict  to  the  orbit  itself  in  this  report. 

But  now  notice  that  the  so-defined  SD  does  not  always  imply  what  we  want  to  mean  by 
chao«5  A  simple  example:  consider  a  free  particle,  moving  on  a  sti  night  line  for  simplicity. 
The  phase  space  has  points  x  =  (f,  v),  v  =  £,  and  orbits  f(i)  =  vt  +  £0,  v(t)  =  v-  const. 


59 


Take  the  euclidean  norm  d(x',x )  =  [(£'  -  £)2  +  (v>  — v)2]*-  Then  it  is  clear  that  for  any 
initial  point  x  =  (£,v)  and  different  point  x'  =  (£',  v')  with  v'  ^  v,  we  have 

d(<f>t(x'),<j>t(x))  >  |u'  -u|*. 

Hence,  no  matter  how  close  x'  is  to  x,  d  >  any  given  e  for  some  time  t;  if  e  =  1,  d  >  1 
fpr  t  =  l/\v'  -  v|.  So  the  dynamics  has  SD.  at  every  point  in  phase  space;  every  point 
belongs  to  the  Julia  set.  Similarly,  many  integrable  systems  show  this  “trivial  SD”  on 
their  unbounded  orbits. 

There  is  a  yepy  natural  way  to  limit  the  SD  (5-2)  to  ensure  that  it  is  equivalent  to  chaps. 
Like  all  good  ddmitions,  it  eschews  technicalities  and  a  list  of  special  cases;  it  achieves  the 
desired  concept  at  a  single  inspired  stroke  (Wiggins,  1990,  hereafter  abbreviated  as  W). 
First,  let  us  define  SD  for  sets  A  with  an  e  the  same  for  all  x  6  A.  Second,  let  us  require 
that  the  set  A  be  invariant  and  compact.  “Invariant”  we  already  know  from  Ghap.  2; 
it  means  that  A  is  carried  into  itself  by  the  time-evolution,  so  that  SD  is  not  a  transient 
property,  A  compact  means  that  it  is  closed  and  bounded.  Making  all  of  this  precise,  we 
end  up  with  the  definitions 

Def.  The  dynamics  D  with  orbit  function  <j>t  has  sensitive 

dependence  on  the  invariant  compact  set  A  if  there  exists  an 

e  >  0  such  that  for  any  x  6  A  and  any  neighborhood  M  of  a: 

there  exists  an  x'  6  M  and  a  t  >  0  such  that  ef(^t(a:'),  <£((x))  >  e.  (5  —  3) 

There  is  another,  independent  feature  of  “chaotic”  motion  on  a  set  A,  namely  that  the  set 
be  mixed  up  properly. 

Def.  For  a  dynamics  D  with  orbit  function  V  is  topologically 

transitive  if,  for  any  pair  of  open  sets  U\rUi  C  V,  the  image  of  U\ 

under  the  dynamics  intersects  Ut  at  some  time:  <j>t(Ui)f\  I72  ^  0 

for  some  time  t  >  0.  (5  —  4) 

Following  Wiggins  and  Devaney  (1986),  we  require  both  (5-3)  and  (5-4)  for  true  chaos. 

Def.  The  dynamics  D  is  chaotic  on  an  invariant,  compact  set  A  if  D 

has  sensitive  dependence  and  is  topologically  transitive  on  A.  (5  —  5) 


For  convenience  of  expression  we  can  also  say  that  A  is  chaotic  in  the  case  (5-5).  Thus  we 
have  finally  a  precise  definition  of  chaos  in  any  dynamical  system  (and  we  can  drop  the 
quotation  marks  in  “chaos”  when  referring  to  this  definition). 

Sensitive  dependence  on  sets  (5-3)  and  chaos  can  be  generalized  to  maps  of  the  orbit  or 
“observables”  as  noted  above.  Again  one  gets  the  same  SD  and  chaos  with  respect  to  the 
map  as  with  respect  to  the  orbit  itself  if  F  :  M  ->  M  is  continuous. 

Requiring  SD  on  compact  sets  in  phase  space  as  in  (5-3)  for  chaos  rather  than  just  pure 
SD  (5-2)  excludes  those  cases  of  “trivial  SD”  from  classification  as  chaos.  For  our  free 
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particle  example  above,  the  orbits  are  clearly  unbounded,  so  could  never  be  contained  in 
a  compact  set  A. 

5.2.1  Lyapunov  Stability 

It  is  interesting  to  get  an  expression  of  regular  (non- sensitively  dependent)  motion  by 
negating  the  minim.al  definition  (5-2)  of  SD.  This  reads 

Dof.  The  dynamics  D  is  Lyapunov  stable  at  state  x  if,  given  e  >  0, 

there  exists  a  8  >  0  such  that  if  d(x',x)  <  5,  then  d(<j>t(x'),  (f>t(x))  <  e 

for  all  t  >  0.  (5  —  6) 

Cf.  W,  page  6,  def.  1.1.1.  In  the  old  work  on  iteration  of  complex  analytic  maps  F(z), 
z  €  C,  mentioned  before,  such  a  map  F  was  said  to  be  normal  at  z.  Def.  (5-6)  is  the 
nearest  thing  to  continuity  that  we  can  define  for  whole  orbits.  Comparing  (5-6)  with  the 
previous  definition  of  the  stability  of  a  fixed  point  in  section  2.3,  we  see  that  the  latter 
is  none  other  than  Lyapunov  stability  specialized  to  the  degenerate  orbit  x(t)  =  x *  if  we 
define  neighborhoods  via  the  metric  d  in  the  usual  way.  So  (5-6)  is  now  seen  as  the  logical 
extension  of  (2-13a)  to  whole  orbits. 

5.2.2  Maximal  Lyapunov  Exponent 

The  Lyapunov  exponents  xi ,  \'2>  *  •  •  XN  for  a  map  or  a  flow  in  can  be  defined,  but  this 
involves  some  tricky  technicalities  which  don’t  arise  if  N  =  1  (Def.  (3-11)  for  maps).  See 
GH  (19S3).  So  we  shall  skip  that  in  this  report  and  instead  define  the  rn.axim.al  Lyapunov 
exponent  x  for  our  general  phase  space  M  =  any  metric  space.  Let  the  dynamics  have  the 
orbit  function  (f>t . 


Def.  x(*)  =  lim  sup  j  In  \\D<f>t(x)(x'  -  x)\\  (5  -  7) 

‘-00  *'6JST(x)  t 

if  this  limit  exists  for  all  sufficiently  small  neighborhoods  ftf(x),  D(f>t(x)  is  the  Jacobian 
matrix  of  <f>t  at  x  for  or  its  generalization  for  more  general  phase  spaces  M  such  as 
Hilbert  space  H  for  QM.  In  (5-7)  the  Jacobian  matrix  acts  on  the  vector  x'  —  x,  and  we 
take  the  length  ||  •  •  •  j|  of  the  resulting  vector.  Sup  means  least  upper  bound,  here  over 
points  x1  in  sufficiently  small  neighborhoods  of  x.  Thus  we  are  varying  the  directions  of 
the  small  vectors  on  which  the  Jacobian  matrix  acts  and  taking  the  maximal  vector  length 
so  found.  A  numerical  algorithm  for  computing  x  is  given  by  Benettin  ct  al.  (1976). 

A  little  reflection  reveals  that  \(x)  measures  the  maximal  ratio  of  exponential  divergence 
of  the  orbit  based  at  x  from  other  infinitesimally  near  orbits.  \(x)  >  0  for  an  orbit  will  be 
defined  as  exponential  SD.  Now  what  we  expect  intuitively  is  x( .u)  ^  0  if  the  orbit  through 
x  is  Lyapunov  stable.  I  know  of  no  formal  proof  of  this,  but  it  seems  very  plausible  and 
we  shall  accept  it  here.  So  x(ff)  >  0  for  some  x  implies  the  violation  of  Lyapunov  stability 
(5-4),  so  that  its  negation,  minimal  SD  (5-2),  is  valid.  Exponential  SD  is  therefore  just  a 
special  case  of  minimal  SD 
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The  motion  in  homoclinic  and  heteroclinic  tangles  has  been  studied  carefully  by  mathe¬ 
maticians  (GH,  1983,  Secs.  5.2,  5.3;  W,  1990,  Secs.  4.3,  4.4,  4.8)  and  proved  to  possess 
SD  on  compact  invariant  sets  A,  (5-3).  These  sets  A,  called  Cantor  sets,  are  really  weird 
and  intuition-defying.  They  are  closed,  totally  disconnected,  and  perfect  (every  point  is  a 
limit  point).  See  the  mathematical  glossary  in  GH  for  these  terms.  In  fact,  the  motion 
is  topologically  equivalent  to  a  certain  “shift  map”  on  the  space  of  bi-infinite  sequences  of 
N  symbols.  This  symbolic  dynamics  can  be  proved  to  be  chaotic  according  to  (5-5)  (sec 
W  section  4.2).  However,  to  my  knowledge,  it  is  unknown  whether  this  extremely  chaotic 
motion  shows  exponential  SD 

5.2.3  Sensitive  Dependence  and  Chaos  in  Quantum  Mechanics 


The  question  of  whether  there  is  “chaos”  in  QM  is  one  of  intense  interest  at  the  present 
moment  (1990),  hotly  disputed,  with  claims  and  counterclaims  compounding  a  confused 
situation.  Part  of  the  trouble  is,  of  course,  the  unsettled  nature  of  the  definition  of  chaos, 
implicit  in  my  use  of  quotation  marks  around  this  term  in  most  of  this  report.  However, 
with  chaos  clearly  defined  as  in  (5-5),  rigorous  proofs  that  there  is  no  chaos  in  many 
important  observables  of  QM  can  be  constructed  (Ingraham,  1988).  We  treat  this  topic 
briefly  in  this  section  and  give  such  a  proof  in  appendix  B. 


Consider  the  self- correlation  C(t,r)  =  (i>(t)\ij>(t  +  r))  or  the  auto-correlation  functional  of 
it 


c(T)=l“fl  c^,lt 


(5-8) 


These  have  been  computed  numerically  for  simple  model  systems  and  plotted  both  in  t  or 
r  space  and,  Fourier-transformed,  as  power  spectra.  Decaying  behavior  in  r  and  broad 
band  power  spectra,  “signatures  of  chaos,”  have  been  seen  (Milonni  ct  al.  19S7,  section 
52;  Pomcau  ct  al.  1986). 


However,  we  can  prove  that  there  is  no  chaos  (5-5)  in  these  systems  by  showing  that  there 
is  no  minimal  SD  (5-2)  in  these  systems,  that  is,  that  they  are  Lyapunov  stable  at  every 
state  if  G  'H.  The  proof  can  be  done  for  any  QM  system  -  any  dimension  of  H,  forced 
or  not,  etc.  One  way  would  be  to  show  that  C(t,  r)  is  a  continuous  function  of  the  orbit 
<^<(.t)  =  the  state  vector  xf(t)  in  the  Schrodinger  Picture,  in  accordance  with  the  theorem 
on  observables  F  mentioned  above.  For  the  state  vector  itself  almost  trivially  has  no  SD 
We  show  that  it  is  Lyapunov  stable. 

Given  the  orbit  ?/>(t),  let  xf'{t)  be  an  orbit  initially  near  the  first  one:  ||V’,(0)  —  ?/>(0)||  <  6, 
where  ||  •  •  •  ||  is  the  usual  QM  norm  defined  in  (5-1).  Let  U(t.,0)  be  the  unitary  time- 
development  operator.  Then  at  any  time  t 


ll*'(0  -  *(011  =  mu  o)(*'(o)  -  *(0))||  =  ||*'(o)  -  *(0)11  <  s  (5-9) 

because  U[t,  0)  is  unitary,  does  not  change  the  norm  of  any  state,  Q.E.D. 

But  it  is  probably  easier  to  prove  directly  that  C(t,r )  is  Lyapunov  stable.  See  appendix 
B.  From  no  SD  for  C(t,  r),  no  SD  for  C(r)  in  r  follows  immediately. 
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Another  map  of  great  importance  is  the  expectation  value 

(q)(t)  =  (^(<)|  q  |^(0)  (5  ~  10) 

of  observable  q  at  time  t.  For  example,  the  occupation  probability  of  energy  eigenstate  <f>n 
at  time  f ,  |(<£n|t/>(i))|2,  is  the  expectation  value  (5-10)  for  operator  q  =  \<j>n){<f)n\  = 
the  projection  onto  the  state  <j>n.  These  have  been  plotted  for  model  systems,  both  as  time 
series  and  power  spectra,  in  the  literature,  and  have  appeared  “chaotic.”  See,  for  example, 
Milonni  et  al.  (1987,  section  52).  However,  it  can  be  proved  that  ( q)(t )  for  any  QM  system 
and  any  bounded  operator  shows  no  SD  and  hence,  no  chaos. 

Def.  Operator  q  is  bounded  if  ||<j^||  <  M||^||,  all  <f>  G  H  (5  —  11) 


for  some  constant  M  >  0.  Since  this  proof  is  similar  to  the  one  for  C(t,r)}  we  leave  it  as 
a  problem  for  the  reader. 

Prob.  5.1.  Prove  that  (<jr)(t)  has  no  SD  for  q  bounded.  (Hint:  see  appendix  B). 

It  begins  to  look  as  if  one  can  prove  analytically  that  there  is  no  chaos  in  QM!  But  the 
situation  is  not  yet  clear  because  of  the  existence  of  unbounded  operators  q  in  QM:  (5-11) 
not,  true,  or  equivalently,  their  spectra  unbounded  above  or  below  or  both.  These  can’t 
be  ignored:  a  few  examples  are  position  and  momentum  of  free  particles  or  oscillators, 
energies  of  n-elcctron  atoms,  orbital  angular  momentum,  etc.  The  role  that  unbounded 
operators  play  in  possible  chaos  in  QM  is  an  open  question  (July  1990). 

5.3  Discrete  Fourier  Transform  and  Power  Spectrum 

Say  we  have  a  set  of  numbers  Xj}  j  —  0, 1,2,  •  *  •  N  —  1,  where  N  »  1.  They  might  be 
obtained  from  sampling  a  flow  at  equally  spaced  times  j  At  over  a  time  T  =  N  At,  or  from 
a  map  dynamics  xj  =  F>(x o)  over  a  large  number  JV  of  iterations.  Define  the  discrete 
Fourier  transform  of  this  set  by 

N- 1 

=  *=0,1,2,-JV-1.  (5-12) 

v  ■ N 

x-k  corresponds  to  frequency  vk  =  k/T.  The  inverse  is 

1  w-i 

=  i  =  0,l,2,..-A'-l,  (5-13) 

k=0 

obtained  via  the  “orthogonality  relation” 

A'-l 

Y,  =N6kj,  0<k,j<N-l.  (5-14) 

n= 0 
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Eq.  (5-14)  is  nothing  but  the  statement  that  the  iVth  roots  of  unity  form  a  closed  regular 
polygon  in  the  complex  plane.  Then  the  discrete  power  spectrum  Pp  is  defined 

PdM  =  \xk\\  k  =  0, 1,2, . .  •  N  -  1.  (5  -  15) 

If  we  want  to  study  a  slice  of  a  dynamics  (typically  in  the  asymptotic  regime)  by  the  discrete 
Fourier  transform,  we  choose  T  and  At  such  that  the  important  system  frequencies  are 
contained  in  the  interval  (1/T,(JV  -  1)/T)  «  (1/T,  1/Af).  Incidentally,  the  fast  Fourier 
transform,  or  FFT  (see  Berge  et  al.  1984,  section  III.4),  greatly  shortens  the  numerical 
evaluation  of  (5-12). 

Several  features  that  distinguish  the  discrete  Fourier  transform  and  its  power  spectrum 
from  the  usual  (continuous)  Fourier  transform  and  its  power  spectrum  should  be  noted. 

a.  If  the  set  {zj}  is  a  slice  of  a  stable  periodic  orbit  of  period  r  «  T,  then  the 
discrete  power  spectrum  Pp(vk)  has  peaks  at  all  harmonics  of  the  fundamental  frequency 
1/r  less  than  I'm  ax  «  1/At.  For  example,  for  a  stable  n-cycle  of  a  map,  Pp  has  peaks  at 
the  harmonics  m/n,  0  <  m  <  n  —  1,  of  the  fundamental  frequency  l/n.  This  is  a  direct 
reflection  of  the  fact  that  the  Fourier  series  of  a  process  restricted  to  time  interval  (0,  r) 
or  of  a  process  periodic  with  period  r  has  nonzero  Fourier  amplitudes  in  general  at  all 
harmonics  of  the  fundamental  frequency  1/r. 

b.  The  peaks  in  Pp(vn-)  at  the  dominant  frequencies  arc  finitely  high  and  broad 
and  are  asymmetrical.  The  power  spectrum  does  not  vanish  between  these  peaks.  It  is 
symmetric  around  »/max/2. 

c.  Contrast  the  behavior  of  the  discrete  power  spectrum  Pp  and  of  the  usual 
continuous  power  spectrum  P  developed  for  stationary  ergodic  random  processes  (Rcif, 
19G5,  Secs.  15.13  -  15.15).  For  the  quasi-periodic  motion  (4-16),  P(v)  is  a  sum  of  delta 
functions  at  the  frequencies  ±j/j,  i  =  1,2, •••n.  These  arc  infinitely  high  and  infinitely 
narrow  peaks,  and  P(v)  =  0  elsewhere.  Pp(vk),  on  the  other  hand,  is  peaked  not  only  at 
the  vi  but  also  at  all  their  harmonics  in  the  sense  of  remark  (a).  These  peaks  are  finitely 
high  and  broad,  in  accordance  with  remark  (b).  Call  these  spikes. 

The  discrete  power  spectrum  is  a  valuable  tool  in  nonlinear  dynamics.  Regular  motion 
shows  up  as  spikes.  A  broad  band  power  spectrum  Pp(vk),  a  more-or-less  continuous 
background  with  a  few  spikes  over  it,  is  accepted  as  a  “signature  of  chaos.” 

Further,  a  discrete  self- correlation 

1 

Cp(m)  =  —  XjXj+in  (5  —  16) 

i= o 

can  be  defined.  Cp(m )  decaying  in  m  corresponds  to  a  broad  band  power  spectrum. 

Prob.  5.2  Compute  the  discrete  power  spectrum  for  the  logistic  map  (3-9)  and 
display  it  on  a  graph  for  the  values  of  p  given  in  Prob.  3.1.  Note  the  different  behavior 
foi  /i  <c  /too  and  fi  /^oo  ^  3.5699  •  •  •, 
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5.4  Algorithmic  Complexity,  Kolmogorov  Entropy,  Etc. 

There  are  other  measures  of  “chaos,”  which  we  shall  treat  much  more  briefly  than  the 
preceding  ones  in  this  section.  The  interested  reader  can  pursue  these  topics,  lightly 
touched  on  here,  by  going  to  the  cited  literature. 

5.4.1  Algorithmic  Complexity 

This  is  a  measure  of  the  randomness  of  the  orbits  of  a  dynamics.  See  Alekseev  and 
Yacobson  (1981)  for  details  and  proofs.  Some  workers,  especially  Joseph  Ford  and  his 
collaborators,  equate  randomness  with  chaos.  An  informal  sketch  follows. 

A  finite  word  is  a  string  of  N  letters  from  some  alphabet  £,  for  example  C  = 
(1,2,  •  •  •  m}.  A  special  case  is  £  =  {0, 1},  the  binary  alphabet.  Then 

Def.  The  algorithmic  complexity  K(sn)  of  the  finite  word  is  the 

length  (number  of  symbols)  of  the  shortest  algorithm  which  will 

print  the  word.  (5  —  17) 


This  definition  depends  (apparently)  on  the  Turing  machine,  the  algorithm,  but  can  be 
made  machine-independent.  Algorithms  are  also  called  “programs,”  “recursive  relations,” 
or  “rules.”  We  have  1  <  K(sn )  <  N  in  any  case.  If,  for  large  jV,  K(sn)  is  not  appreciably 
shorter  than  N  itself:  K(s^)  =  0(N)  as  N  — ►  oo,  the  word  is  called  random. 

Define  the  algorithmic  complexity  (AC  for  short  hereafter)  of  an  infinite  word  s0 0  as 

£(««,)  =  Jim  K{sn)/N,  (5  -  IS) 

where  sn  is  the  first  N  letters  of  Soo.  Then  if  £($oo)  >  0,  the  infinite  word  is  called 
random. 

Next,  consider  a  map  dynamics  F  :  X  -+  X,  where  the  phase  space  X  may  be  quite 
general.  X  is  partitioned  into  a  finite  number  of  disjoint  sets  {E\,E2}  -  ■  ■  Ep}  =  £  (a 
coarse  graining  of  X).  Then  the  actual  orbit  Fn(x),  n  >  0,  is  replaced  by  its  itinerary:  the 
infinite  word  u>i,W2,u>3,*  •  •  according  as  the  nth  point  falls  into  set  EWn  :  Fn(x)  6  EUn. 
We  then  define  the  AC  of  this  infinite  word  as  in  (5-17)!  Since  the  itinerary  depends  on 
the  initial  point  x,  the  partition  £,  and  the  map  F,  denote  this  AC  by  IC(x,F\£). 

fC(x,F\£)  =  Jim  K(un)/N ,  (5-  19) 

/V— *oo 

where  is  the  finite  m-ary  sequence  uq,W2,  •  •  >uw.  Now  let  K.(x,F)  be  the  least  upper 
bound  (or  sup)  of  (5-18)  over  all  partitions  £.  Then  we  say  that  the  orbit  based  at  x  is 
random  if  fC(x,F)  >  0. 

We  can  see  the  motivating  idea  behind  this  formalism.  If  the  orbit  is  really  “random”  in 
the  intuitive  sense,  there  is  no  short  algorithm  available  to  print  it  out.  We  must  simply 
numerically  calculate  the  whole  orbit.  This  makes  X(x,F)  >  0. 
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5.4.2  Kolmogorov  and  Other  Entropies 

This  is  an,  idea  closely  related  to  algorithmic  complexity,  whose  aim  is  to  quantify 
randomness  in  a  dynamics.  Again  one  works  with  maps  F  on  very  general  spaces.  The 
entropy  h(F )  measures  the  uncertainty  in  predicting  the  ( N  +  l)th  point  of  an  itinerary, 
given  the  first  N  points,  for  N  »  1.  Again  one  uses  a  partition  £  of  phase  space,  and 
replaces  an  orbit  by  the  labels  of  the  coarse-graining  sets  that  it  visits.  Kolmogorov  (or 
metric )  entropy  uses  measure  theory  and  a  generic  orbit.  The  topological  entropy  is  defined 
slightly  differently,  following  individual  orbits  of  F.  See  Alekseev  and  Yacobson  (1981) 
and  Young  (1983).  The  upshot  is  that  the  dynamics  is  called  random  or  unpredictable  if 
the  entropy  h(F)  >  0. 

To  my  knowledge,  the  precise  interrelationships  of  sensitive  dependence,  algorithmic 
complexity,  and  the  various  entropies  have  never  been  clarified.  There  arc  scattered  cross- 
connections  like  the  Pesin  formula,  (see  Young,  1983,  p.  G04).  The  ideas  of  unpredictability, 
randomness,  and  chaos  certainly  overlap;  they  cannot  be  independent.  For  instance,  one 
could  ask:  does  a  dynamics  which  shows  SD  in  the  minimal  sense  (5-2)  necessarily  have 
positive  algorithmic  complexity  and  entropy?  My  conjecture  is  “Yes.” 
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6.  RENORMALIZATION  GROUP 

6.1  Introduction 

There  are  phenomena  in  mathematics,  physics,  and  other  sciences  which  occur  repetitively 
on  all  length  scales.  The  geometry  of  such  invariant  limit  sets  is  called  self-siinilar 
(precisely,  the  part  is  topologically  equivalent  to  the  whole  under  a  scale  change  x  ax, 
a  =  const.).  We  already  saw  an  example  of  this  in  the  period-doubling  process  in  the 
logistic  map  (section  3.10.2).  An  all-important  example  is  the  phenomenon  of  phase 
transitions  in  condensed  matter.  At  the  critical  point  the  interactions  are  self-similar 
under  arbitrary  changes  of  scale  ( “coarse-grainings” )  because  the  correlation  length  is 
strictly  infinite  at  the  critical  point.  The  clearest  explanation  of  this  in  nontechnical 
language  that  I  know  is  Bruce  and  Wallace  (1989).  This  directly  leads  to  an  algorithm  for 
computing  the  famous  critical  exponents  of  thermodynamics,  for  example,  the  exponent  cv 
in  the  heat  capacity  C  near  the  critical  temperature  To, 

Ccx\T-Tc\~a  for  |T  — Tc|  small,  (6-1) 

which  has  a  typical  value  a  «  0.125. 

This  algorithm,  known  as  the  renormalization  group  (RG),  exploits  the  self-similarity  by 
defining  a  transformation,  or  map,  R  of  coupling  constants.  If  R  is  iterated,  the  system 
is  taken  into  the  asymptotic,  noncritical  regime  of  weak  coupling  constants,  where  known 
analytic  results  or  perturbation  theory  give  reliable  results.  Iterating  backward,  we  recover 
desired  thermodynamic  variables  such  as  a  free  energy  in  the  realistic  coupling  range.  The 
RG  can  thus  in  principle  deliver  exact  results  inaccessible  to  perturbation  theory.  The 
system  at  the  critical  point  a:c  itself  is  not  changed  under  the  iteration  because  of  the 
self-similarity  there:  R(xc)  =  xc,  that  is,  xc  is  a  fixed,  point  of  the  RG  coupling  constant 
map  R.  This  ties  phase  transitions  to  nonlinear  dynamics,  its  concepts  and  its  techniques. 
In  particular,  linear  stability  analysis  at  x*  =  xc  gives  the  critical  exponents.  For  a  simple 
treatment  of  the  mathematics,  see  Maris  and  Kadanoff  (1978)  or  Chandler  (1987,  Secs. 
5.6,  5.7);  for  the  more  advanced  and  complete  theory  (see  Plischke  and  Bergersen,  1989, 
chap.  6). 

6.2  RG  for  Ising  Lattices 

Ising  lattices  are  first  of  all  lattices :  regular  nD  repeating  patterns  of  discrete  points. 
Second,  there  are  spins  s,,  i  =  1,2, 3, •••IV,  at  the  N  lattice  sites,  each  having  only  two 
values  “up”  or  “down”:  s,-  =  ±1,  all  i.  A  particular  configuration  {sj , S2, •  •  •  S/v)  =  v, 
where  each  s;  =  +1  or  -1,  is  a  (microscopic)  state  of  the  system.  Thus  there  arc  2N  states. 
The  energy  is  due  to  interactions  between  nearest  neighbor  spins  in  the  lattice;  the  spins 
can  also  interact  with  an  external  magnetic  field  H  via  their  magnetic  moments  fi.  Thus 
we  can  write  for  the  energy  Eu  of  the  state  v 

E„  =  -jJ2  siSj  -  nH  V  si  ,  (6-2) 
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where  the  prime  on  £  restricts  the  sum  to  nearest  neighbor  pairs.  J  >  0  corresponds  to  a 
model  of  a  ferromagnet,  but  Ising  lattices  model  very  many  systems,  even  “lattice  gases," 
under  a  slight  reinterpretation. 

The  basic  problem  in  equilibrium  statistical  mechanics  is  to  evaluate  the  partition  f  unction 

Z, 

Z  =  Y,e~0Ev  ,P  =  l/kT,  (6-3) 

V 

where  T  =  absolute  temperature  and  k  =  Boltzmann’s  constant.  The  sum  is  over  all  states 
v.  (The  grand  partition  function  will  not  enter  here.)  From  Z  the  entire  thermodynamics 
follows! 

For  dimension  n  =  1  (6-2)  can  be  written 

N  N 

Ev  =  J  ^  ^  —  fiH  'y  ^  S{  (s^v+i  =  ),  (6  —  4) 

«=i  »=i 

where  for  convenience  the  ID  lattice  (“chain”)  was  closed  by  imposing  periodic  boundary 
conditions.  Then 

Z(I( ,  /i, N)  =  Y'  exp(/vs,-s;+i  +  /is,-),  (6  -  5) 

where  K  =  (3J,  h  =  (S/J.H,  and  the  sum  is  over  all  2N  states  {s,-}.  The  sum  can  actually 
be  done  in  closed  form  (!)  (see  Plischke  and  Bergersen,  1989,  section  3.F). 

Prob.  6.1  Evaluate  (6-5).  This  is  mainly  to  get  you  thinking  about  such  large  sums 
and  to  see  how  nontrivial  even  the  simple-looking  sum  in  (6-5)  is!  For  the  transfer  matrix 
method,  see  Plischke  and  Bergersen,  or  Chandler  (1987,  Ex.  5.21). 

For  h  =  0  the  answer  is  especially  simple, 

Z{K,N)  =  (2  cosh  A')*  (6-6) 

(A  clarification:  this  is  the  answer  in  the  so-called  thermodynamic  limit  N  — »  oo,  that 
is,  the  leading  term.)  Then  it  turns  out  that  the  ID  Ising  lattice  has  no  (proper)  phase 
transition:  the  order  parameter  m  =  ($i ),  any  i,  the  mean  value  of  the  spin  for  h  =  0, 
stays  zero  for  all  T  >  0. 

6.2.1  ID  Ising  Lattice  bv  the  RG 

Now  we  wish  to  derive  (6-6)  by  a  recursive  scheme,  the  RG,  and  verify  that  there  is  no 
phase  transition.  We  follow  Maris  and  Kadanoff  (1978)  and  Chandler  (1987). 

Write  out  the  sum  (6-5)  with  h  =  0  and  group  the  factors  in  a  certain  way: 

Z(I< ,  N)=  exp[K(sis2  +  s2sz  +  s354  +•••)] 
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=  53  exp  K(siS2  -r  $2S3)exp  K(s3$4  -f  $4$5)  -  -  -  - 

Sum  first  over  even-numbered  spins  2, 4, 6,  -  -  -. 

Z(K,N)  =  {exP-K'(«i  +  +  exp{— iv)(s,  53)} x 

{exp  A"(s3  s5)  +  exp(-K)(s3  +  s5)}  x  -  -  - .  (6-7) 

Every  other  degree  of  freedom  has  been  removed;  the  spins  have  been  “thinned  out”  (see 
Fig.  6.1). 


00000  — ^  o  o 

1  2  3  4  5  *  •  •  1  3 


Fig.  6.1  “Thinned  out”  Ising  chain  (see  text) 

Now  we  try  to  make  (6-7)  look  like  the  original  sum  for  N/2  lattice  sites  with  a  possibly 
different  coupling  constant  K'.  If  this  is  possible,  a  recursion  relation  can  be  developed. 
So  set 

eK(s+s')  +  e-K(,+,')  s  s  s,  s'  =  ±1.  (6  -  8) 

If  this  were  possible,  then  from  (6-7)  and  (6-8)  we  would  get 

Z(K, N)  =  52  f{K)eK'a'3>  x  f(K)eK'a*a*  x  •  •  • 

*l.*3i*5.*" 

=  [f(K)]N/2Z(K',N/2),  (6-9) 

with  the  same  function  Z\  Returning  to  (6-8),  we  see  that  there  are  only  two  different 
cor  litions: 

s  =  s'  =  ±1  ,  e2K  +  e~2K  =  f(K)eh" , 
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s  —  —s'  =  ±1  ,  2  =  f(K)e 


(6-10) 


K‘ 

These  can  be  solved,  and  we  get 

K1  —  —  lncosh2Jif  ,  f(K)  =  2(cosh2701/2-  (-4) 

Now  by  extensivity  we  can  set  ln2(iT,  Ar)  =  Ng(K).  Since  the  (Helmholtz)  free  energy 
G  =  —IcTXnZ  in  general,  we  see  that  g  =  —0  G/N ,  apart  from  the  factor  —0  the  free 
energy  per  spin.  We  call  g  “the  free  energy”  from  now  on.  The  natural  logarithm  of  (6-9) 
can  then  be  rewritten  g(K)  =  (1/2)  In  f(K)  +  (1/2 )g(K'),  or,  with  (.4),  as 

g(K')  =  2g(K)  -  ln(2(cosh2A')1/2].  (S) 

Equations  (.4)  and  (2D  are  called  the  RG  equations  of  the  ID  Ising  lattice.  By  noting 
/(A')  =  2  exp  1C  from  (.4),  the  inverse  transformation  can  be  written 

K  =  icosh'1(e2'1'')  ,  (C) 

«(A')  =  |9(/f')+|ln2+y  .  (p) 

Wc  note  immediately  that  K'  <  K  always. 

The  first  application  we  leave  as  a  problem. 

Prob.  6.2  Apply  (C)  and  (V)  starting  with  small  K'  and  iterate  to  get  g{K)  for 
“realistic”  values  of  K  m  2.7.  Take  say  K'  =  0.01,  for  which  you  can  use  Z(I(',  N )  « 
Z(0,N)  =  2n  from  the  exact  result  (6-6),  and  thus  g(I(')  ~  In  2.  Compare  g(K)  at  each 
step  with  the  exact  value  derived  from  (6-6).  Notice  how  fast  the  iteration  converges  to 
the  exact  values. 

On  the  other  hand,  we  could  start  with  large  I(,  taking  Z(K,N )  «  eNh  and  g(I()  «  I( 
from  (6-6),  and  iterate  (A)  and  (B)  to  try  to  calculate  g  for  medium  K.  But  this  does  not 
work,  the  errors  grow  exponentially! 

Second  application.  Examine  the  “flow”  (used  here  for  a  map  orbit)  of  the  coupling 
constant  under  (.4)  and  (C).  See  Fig.  6.2. 

K*  =  0,  oo  arc  the  only  fixed  points  of  (.4)  or  its  inverse  (C).  Point  0  is  stable,  point  oo  is 
unstable  under  the  map  (.4):  K'  =  R{K)  =  (1/2)  In  cosh  2I{,  you  can  check.  What  this 
means  physically  is  as  follows:  on  thinning  out  the  spins  (going  to  a  larger  length  scale  = 
lattice  spacing),  we  go  to  a  system  with  a  weaker  coupling  constant  K'  <  K.  This  is  the 
meaning  of  the  flow  in  Fig.  6.2.  Only  the  points  K  =  0  and  oo  are  independent  of  this 
change  of  scale  since  K*  =  0  (T  =  oo)  is  a  completely  disordered,  K*  =  oo  (T  =  0),  a 
completely  ordered,  regime.  A  proper  phase  transformation  would  correspond  to  a  finite 
I\*  0  or  oo)  also  independent  of  a  change  of  scale,  thus  a  fixed  point  I\*  of  R.  -fc  0  or  oo. 

The  absence  of  such  a  finite  fixed  point  in  addition  to  the  trivial  fixed  •points  K*  =  0,  oo  of 
the  map  R  means  that  the  Ising  chain  shows  no  phase  transition. 
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points 


One  can  define  a  correlation  length  £  by 

M  =  e-)7{  ,  j  =  1,2,  -  JV.  (6-11) 

The  exact  Ising  chain  theory  gives  £  =  —  (lntanh  If)-1.  Thus  £  =  0  at  K *  =  0  and 
£  =  -f  co  at  K*  =  oo,  so  the  system  is  completely  uncorrelated  at  T  =  oo  and  perfectly 
correlated  af  T  —  0.  This  makes  physical  sense. 

N.B.  £  =  oo  is  taken  as  the  definition  of  critical  point.  In  this  sense  the  Ising  chain  is 
sometimes  said  to  have  a  phase  transition  at  Tc  =  0. 

6.2.2  2D  Ising  Lattice  bv  the  RG 

For  n  =  2  we  get,  for  h  =  0, 


Z  =  Y^  expfA"  E  SiS& 


(6  -  12) 


where  I(  =  ftJ  again  and  the  spins  are  now  at  the  sites  of  a  2D  square  lattice,  let  us 
say.  Already  the  evaluation  of  the  sum  (6-12)  is  so  horrendously  difficult  that  its  exact 
evaluation  in  1944  made  L.  Onsager  famous.  The  solution  answered  many  deep  and 
worrisome  questions.  Chief  result:  there  is  a  phase  transition  at  critical  temperature 
0c  =  1  / kTc  =  .44069/  J.  Further,  a  whole  set  of  physically  reasonable  thermodynamical 
critical  exponents  was  obtained.  See  Plischke  and  Bergersen  (1989,  section  5.A)  for  details. 
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Now  we  try  to  apply  the  RG  niethod  to  (6-12).  We  proceed  just  as  in  section  6.2.1,*  st»  We 
can  be  briefer.  Write  out  (6-12)  and  group  the  factors  into-  sets  each  of  which  contains  just 
one  spin  from  every  other  diagonal,  spins  •  •  •  5, 6,  •  •  •  (see  Pig.  6,3).  Then  sum  over  spins 
•  •  *5, 6,  • 

Z(K,N)  =  exp[/Gs5(si  4  s2  +  s3  4  34)]  exp  [Ksafa  +  $$  4  57  4  38)]  •  •  * 

=  {exp[7v(si  +  S2  4  33  4  34)]  4  exp[(— K){s\  +  s2  4  33  4  34)]}  x 

remaining  a; 

{exp[Jf(S2  4  33  4  S7  4  38)]  4  exp[(-I<)(s2  4  s3  4  S7  +  3g)]}  x  •  •  • .  (6-13) 

But  how  we  canhot  set  the  first  factor  {•  •  •}  in  (6-13)  equal  to  f(K)exp[K'($i$2  4  S2S3  4 
3334  4-  34S1)]  (that  is,  nearest  neighbors  in  the  thinned  out  lattice)  for  all  3,-  =  ±1,  i  = 
1,2, 3}  4,  since  this  gives  four  independent  equations,  and  we  have  dhly  f(K)  and  K'  to 
choose;  So  we  impose 

exp[A'(3i  +  32  +  33  -f  34)]  4  exp[(— AT)(si  4  32  4  33  4  s4)] 

=  f(K)exp[-Ki  (31 32  4  3233  +  S3S4  4  3431) +K2  (3i33  4  3234) I  4 A3  3i 3233 34],  (6  —  14) 

nn  nnn  square 

where  we  have  labeled  the  terms  nn  for  nearest  neighbors,  nnn  for  next  nearest  neighbors, 
and  square  for  the  four  spins  around  a  square  in  the  coarse-grained  lattice,  see  Fig.  6.3. 
Then  the  four  independent  possibilities:  all  S{  equal,  three  3;  equal  and  one  unequal, 
5l  =  s2  =  -53  =  -54,  and  si  =  -S2  =  33  =  -34,  (i  =  1, 2,3,4)  give  the  four  equations 


exp  47v  +  exp(-4/f)  =  f(K)exp(2K\  4  2A'2  4  A^), 

exp  2K  4  exp(— 2A')  =  /(Af)exp(- /V3), 

2  =  /(A)exp(-2A24A3), 

2  =  f(K)  exp{—2K\  4  2A'2  4  K9).  (6  -  15) 

The  solution  is 

K\  =  ^  In  cosh  4 K,  K2  =  ^  In  cosh  4  AT,  K$  =  ^  In  cosh  4A'  -  ^  In  cosh  2 A, 

f{K)  =  2(cosh  2A')  2  (cosh  4A”) » .  (6  —  16) 

Now  Substitute  (6-14)  for  each  factor  {•  •  •}  of  (6-13).  A  factor  f{K)N! 2  comes  outside. 
Notice  each  nn  pair  appears  twice,  while  each  nnn  pair  and  set  of  four  spins  in  a  square 
appears  once.  Hence 

z(k,  n)  =  \f(K)]Nf2  55  exp[^i  52  + 55  s,sm + 55  w^]* 

N/2  spins 

(6  - 17) 
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Sum  Y'  is  over  all  nn  pairs,  sum  Y"  's  over  nnn  pairs,  sum  Y"'  over  se^s  °f  four 
spins  in  a  square  in  the  coarse-grained  lattice. 

The  moral  is  this:  due  to  the  greater  connectivity  of  the  2D  lattice,  thinning  out  the  spins 
by  summing  those  on  every  other  diagonal  results  in  a  more  complicated  interaction  (6-17) 
than  the  original  one  (6-12).  This  happens  generally  in  all  dimensions  >  1.  Thus  an  exact 
RG  calculation  of  Z  fails. 

One  can  try  several  approximations  to  salvage  the  method. 

1)  Neglect  772,  Jv3  entirely.  Then 

Z(K,N)  «  \f(K)]N'2Z(KuN/2)  ,  77,  =  -  In cosh 4 A'.  (6  -  IS) 

But  this  is  equivalent  to  the  ID  RG  analysis  (just  set  277  =  77,  27v'i  —  77',  Z(K,N)  = 
Z(K,N ),  and  compare  (6-18)  with  (6-9)  and  (^4)  written  with  tildes  on  77 ,  77',  and  Z.) 
Hence  there  is  no  phase  transition.  Approximation  (1)  is  too  rough. 

2)  Neglect  K3  and  approximate 

Ki  ^  SiSj  +  772  sism  «  77'(77, ,  77;)  ^2  SiSJ  (6  “  19) 

for  some  77'(77,,772).  This  gives  (6-18)  with  this  77'  substituted  for  77,.  Introduce  the 
free  energy  <7 (77)  as  before;  then  the  In  of  this  last  equation  gives 

<7(77')  =  2<j(77)  -  ln{2[cosh  277]  2  [cosh  477] « } ,  (7?) 
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where  the  last  equation  of  (6-16)  was  used  to  substitute  for  f(K).  Now  (6-19)  cannot,  pf 
course,  be  satisfied  for  all  spin  values.  We  want  to  estimate  K'  by  satisfying  it  when  all 
spins  arc  aligned.  Since  there  are  N  nn  and  nnn  pairs  in  a  2D  square  lattice  of  N/2  sites, 
wc  have  in  the  aligned  case 

A'i  2^  sisj  =  JVA'i  ,  Ii2  2_j  sis™  =  NK*  • 
thus  K\  +  Ki  =  K'  from  (6-19).  Then  from  (6-16), 

K'  =  |  In  cosh  4  A-  (4) 

To  summarize:  (A)  and  (B)  are  our  approximate  RG  equations  for  the  2D  Ising  lattice. 
The  inverses  can  be  written  down,  but  we  skip  them. 

Now  the  map  (A)  has  trivial  fixed  points  at  K*  =  0,  oo  as  before.  But  it  has  the  nontrivial 
fixed  point  K*  =  A'c, 

Kc  —  ^  In  cosh  4Afc  =4*  Kc  «  0.50698.  (6  —  20) 

8 

But  now  K*  =  0  and  oo  are  stable,  while  Kc  is  unstable.  See  the  phase  portrait  of  the 
“flow”  in  Fig.  6.4. 


Fig.  6.4  Phase  portrait  of  the  ID  map  A,  showing  the  trivial  stable  fixed  points  and  the 
nontrivial  unstable  critical  fixed  point 

The  physical  meaning  of  this  parallels  the  discussion  at  the  end  of  section  6.2.1.  Going  to 
a  larger  length  scale  now  drives  systems  with  coupling  weaker  than  Kc  to  the  completely 
disordered  regime  I(*  —  0,  but  systems  with  coupling  stronger  than  Kc  are  driven  to  the 
completely  ordered  regime  K *  =  co.  But  systems  with  K  =  Kc  exactly  are  independent 
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of  a  change  of  scale,  have  correlation  length  £  =  oo,  and  correspond  to  a  proper  phase 
transition.  According  to  (6-20),  this  critical  temperature  Tc  is  given  by 

J/kTc  «  0.50698  ;  (6  -  21) 


compare  the  exact  value  0.44069  from  Onsager’s  solution. 


Finally  let  us  see  how  critical  exponents  are  predicted  on  the  RG  approach.  We  assume 
that  the  free  energy  g{K)  has  a  nonanalytic  part  which  «  a(K  —  Kc)2~a  near  Kc.  Then 
from  (B)  above  and 


K'  -  ICc  »  ( dK'/dK ) 


(K  -  Kc) 


Kc 


we  get 


2— or 

(K  —  Kc)2~a  =  2 a{IC  —  Kc )2~a  -j-  terms  regular  at  K  =  I(c.  (6  —  22) 

where  dK'/dK  is  understood  to  be  evaluated  at  Kc.  But  it  follows  from  the  theory  of 
analytic  functions  that  the  nonanalytic  terms  in  (6-22)  must  be  equal.  Cancel  a  (assuming 
o/0)  and  the  factor  ( K  —  A'c)2_Qf,  take  In,  and  get 

(2  -  a)  In  dK'/dK  =  ln2  =*  a  =  2  -  ln2/  \n{dK' /dK)  «  0.131.  (6  -  23) 

dK'/dK  from  (A)  was  evaluated  at  Kc  from  (6-20).  This  a  is  in  fact  the  heat  capacity 
critical  exponent  (6-1)  since  the  relation  of  C  to  g(K)  can  be  checked  to  be 

C/N  =  kK2  d2g(I<)/dK2  , 

which  makes  C  ~  |/v  —  Ac|_a  ~  |T  —  Tc\~a  near  Kc.  Further,  this  is  a  continuous ,  or 
second  order  phase  transition,  since  g,  dg/dK,  but  not  d2g/dK 2  are  continuous  at  K  =  7vc. 

6.3  RG  Applied  to  Period-Doubling 

We  treat  the  ID  quadratic  map  with  parameter  A, 

Fa  =  x2  +  Ax  ,  -2  <  A  <  1  ,  (6  -  29) 

which  maps  the  interval  [—1, 1  —  A]  into  itself.  This  map  is  topologically  equivalent  to  the 
logistic  map  (3-9)  by  a  linear  change  of  variable.  There  are  two  fixed  points  x*  =  0,  -A. 
In  -1  <  A  <  1  both  arc  stable  (remember,  “stable”  will  mean  asymptotically  stable  in 
this  section).  At  A  =  -1  both  go  unstable,  and  the  stable  fixed  points 

4  =  ~\{A  +  1)  ±  ^\/(A  4-  1)(A  -  3)  (0  -  30) 
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of  F2  axe  born.  These  form  a  'stable  2-cycle  of  F.  Consider  t&e  orbit  based  at  x\  +  ii  u 
small.  Then  +  ’ 

xx  =  F(x\  +  u)  =  x*_  +  Axx,  Axx  =  (A  +  2 x\)u  +  u2,  (6  -  3l) 

where  wc  simply  expanded  (6-29)  out  and  used  F{x\)  =  xl.  Iterating  again,  we  get 
obviously 


x2  =  F(x*_  +  Axx)  =  x*+ +  Ax2,  Ax2  =  (A  +  2xt)Axx+(Axx)2  ,  (6-32) 

where  we  used  (6-31)  and  F(xl)  =  4.  Now  substitute  An  from  (6-31)  into  A.r2  from 
(6-32),  and  keep  only  terms  of  0(u2).  By  expanding  out,  this  becomes 

A*2  *  (A  +  2*I)(4  +  24)«  +  [A  +  2x1  +  (A  +  2-4)  2]u2.  (6  -  33) 

This  looks  something  like  the  original  map  (6-29);  by  rescaling 

s'  =  otu  ,  x'2  =  ckAx2,  cc  =  A-\-  2x1  +  (A  +  2r^.)2,  (6-34) 

(6-33)  takes  the  form  x'2  =  F^,(x'),  where 

F'i,(x')  =  x'2  +  A'x\  A'  =  (A  +  2 xl)(A  +  24)  =  -A2  +2A  +  4.  (6  -  35) 

We  used  4  from  (6-30).  But  F'A,  is  the  same  function  which  occurred  in  the  original  map 
(6-29)  for  d  transformed  parameter  P 


.4'  =  R(A)  =  -A2  +  2A  +  4  .  (6-36) 

Thereby  we  have  achieved  an  RG  transformation  for  period-doubling  in  quadratic  maps. 
This  guarantees  the  self-similarity  noted  in  section  3.10.2,  the  period-doubling  cascade. 

An  important  fine  point:  the  map  (6-35)  is  not  the  same  as  the  map  (6-29)  since  different 
regions  are  mapped,  cf.  (6-34)  and  u  =  x  -  4.  That  is,  F'  is  not  the  same  map  as  F\ 
rather,  they  are  topologically  equivalent ! 


Now  we  can  draw  various  conclusions.  The  fixed  points  4  of  F\  are  the  same  states 
as  the  fixed  points  0,  -A  of  F'A,.  Thus  4  are  stable  in  the  range  -1  <  A'  <  1,  or 

< -1  ^  (6-36).  At  A  =  1  -  V?  a  -1.4495  they  lose  stability,  and  two 
stable  fixed  points  of  F  are  born,  that  is,  a  stable  4-cycle  of  F.  Thus  the  RG  method 
can  approximate  the  period-doubling  bifurcation  values  which  we  called  n  >  3  for  the 
logistic  map  in  section  3.10. '  ~  ’ 


We  can  continue  this  way,  defining  period-doubling  bifurcation  values  with  A,  =1-^6. 
InUie  limit  n -*  oo,  An  -»  a  limit  A^.  Solving  A <*,  =  -A2^  +  2aoo  +  4  from  (6-36),  we 

•4oo  =  g  ~  V^7)  ~  —1.5616  .  (6  _  37) 


Pmfb*  .  6’3,Fmd  the  linear  transformation  connecting  map  (6-29)  with  the  logistic 
map.  Thus  rans  ate  the  values  A\  and  ^  found  here  into  values  of  ind  compa^U 
fij  and  ^oo  found  in  section  3.10.  1 
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If  we  assume  that  the  convergence  of  An  is  asymptotically  geometric:  An  —  Aqo  +  C8~n 
(8  >  1),  we  get  the  same  8  found  before,  (3-10).  This  is  no  accident;  one  can  prove  easily 
that  if  we  have  a  new  parameter  A  -  g(p ),  g  differentiable,  then  8g  defined  by  (3-10)  with 
fi  — >  g[fi)  is  exactly  the  same:  8g  —  8.  Further,  the  scaling  parameter  a  defined  in  (6-34) 
should  also  be  asymptotically  universal  (check  by  putting  x±  from  (6-30)  and  -4  =  .4oo 
into  this  formula).  The  “exact”  value  found  for  the  logistic  map  is  a  «  -2.5029078  •  •  •. 

6.4  RG  Applied  to  the  Tangent  Bifurcation 

Tangent  bifurcations  were  described  in  section  3.10.3.  We  want  now  to  calculate  the  time 
of  transit,  the  time  that  the  orbit  point  stays  in  the  “neck”  in  Fig.  3.9,  by  RG  methods. 

Iterate  the  map  Gfl  and,  in  keeping  with  the  spirit  of  the  model,  keep  only  terms  of  0(x2) 
and  0(g). 


G2( x)  =  g,  +  (h  +  x  -  x2)  -  (h  +  x  -  x !)2  «  2fi  +  x  -  2x2.  (6  —  38) 


But 


the  same  as  (6-38),  so 


^G4M(2x)  =  2 h  +  x  -  2x2 
G?.(x) « 1CM2*). 


(6  -  39) 


This  is  exactly  the  statement  that  the  map  G2  is  topologically  equivalent  to  the  map  G. j;l 
in  the  region  X  :  x  and  /i  small  via  the  homeomorphism  h(x)  =  2x,  of.  (3-7). 

Hence  if  we  define:  N(fi)  steps  are  necessary  for  the  orbit  point  of  G,,  to  pass  through  the 
neck,  we  have 

N(fi)/2  =  iV(4/0  ,  (6-40) 


since  the  transit  time  is  only  half  as  long  for  G J  as  for  G(i.  This  functional  equation  has 
the  solution 


N(fi)  =  C j y/—Ji  ,  C  =  const,  (/t  <  0),  (6  -  41) 


which  is  the  desired  answer. 


6.5  RG  Applied  to  Percolation 

Percolation  theory  has  evolved  considerably  since  its  humble  beginnings  as  the  study  of  the 
percolation  of  fluids  through  porous  solids.  For  some  basics,  see  Plischke  and  Bergersen 
(1989,  pp.  317,  318).  The  notions  we  need  are  the  following:  there  is  a  lattice,  say  2D, 
with  site  occupation  probability  p.  At  p  =  pc,  the  critical  probability ,  the  infinite  cluster 
just  forms.  At  this  point  the  configuration',  the  lattice  with  occupied  and  unoccupied  sites 
marked  •  and  o  respectively,  should  be  self-similar,  that  is,  be  invariant  under  an  arbitrary 
change  of  scale  x  i-»  ax ,  (see  section  6.1). 

We  take  a  triangular  lattice  with  some  occupation  probability  p  and  coarse-grain  it  by 
dividing  it  into  blocks  of  three  sites.  These  block  sites  will  be  said  to  be  occupied  or  not  by 
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the  majority  rule:  occupied  if  the  number  of  *’s  is  three  or  two,  unoccupied  if  this  number 
is  one  or  mo.  See  Fig.  G.5. 


•  =  occupied 
o  =  unoccupied 


Fig.  6.5  Coarse-grained  triangular  percolation  lattice 

The  block  sites  are  put  at  the  centers  of  the  original  equilateral  triangles.  Therefore  the 
scale  (def.  lattice  spacing)  has  been  increased  in  the  ratio  1  :  \/3  (see  Fig.  6.5). 

If  p'  is  the  block  site  occupation  probability,  we  have 

P1  =  RVs(P)  ~  P*  +  3P2(l  ~  P)  •  (6-42) 

Ex.  6.1  Verify  (6-42).  This  is  a  simple  exercise  in  using  the  laws  of  probability. 

The  RG  map  (6-42)  has  fixed  points  p*  —  0,1,  and  1/2.  The  first  two  are  trivial  fixed 
points.  Wc  identify  pc  with  the  third,  pc  =  1/2,  for  the  reasons  given  just  above.  The 
“flow”  under  iteration  of  the  RG  map  (6-42)  could  be  investigated  as  before,  but  we  are 
more  concerned  here  with  calculating  a  critical  exponent. 

We  assume  that  the  correlation  length  £(p)  has  behavior 

f(p) «  \p-Pc\~u  (6-43) 

with  critical  exponent  v  near  the  critical  point.  The  aim  is  to  calculate  u  using  the 
RG  method.  Now  we  have  increased  the  scale  by  a  factor  a  >  1  by  the  coarse-graining 
operation,  so  have  decreased  the  correlation  length  by  the  same  factor,  £'(p')  =  £(p)/«. 
See  Bruce  and  Wallace  (1989)  for  a  clear  explanation  of  this.  But  since  the  system  is 
invariant  at  p  =  pc,  the  same  law  (6-43)  holds  for  the  rescaled  lattice, 

('(?')  «  |p'  -  Pci""  -  (6  -  44) 
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Hence  | p  -  pc\  "/a  =  \p'  -  pc\  ",  or 


p'  -pc  =  a1/u(p-pc)  .  (6-45) 

Now  write 

p'  «  Pc  +  {dp1  /dp)  ( p  -  pc)  =  1/2  +  3/2 (p  -  1/2). 

Pc 

Compare  this  with  (6-45);  we  get  a1/"  =  (x/3)1/"  =  3/2,  since  a  =  \/3  goes  with  the  R.G 
map  (6-42).  Solve: 

v  =  In  y/z/  In  3/2  «  1.35  .  (6  -  46) 

More  rigorous  methods  give  v  —  4/3,  so  the  RG  result  is  not  bad. 
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7.  PARTIAL  DIFFERENTIAL  EQUATIONS 
7.1  General  Remarks 


Pde  dynamics  is  the  most  difficult  we  have  yet  met  in  this  survey.  One  must  solve 
differential  equations  in  several  independent  variables  x  €  -ft"  and  in  time  t  for  fields 
$(aj,<)  defined  at  all  points  of  some  region  U  of  and  some  range  of  times.  Thus  there 
are  an  infinite  number  of  degrees  of  freedom  as  opposed  to  the  fihite-diiiiensiohal  phase 
spaces  of  flows,  sets  of  N  ode’s.  (“Pde”  and  “ode”  for  partial  arid  ordinary  differential 
equations,  respectively,  are  abbreviations  common  in  the  literature,  and  we  shall  Use  them.) 
Add  to  this  the  fact  that  there  are  different  species  of  pde’s  (elliptic,  hyperbolic,  etc.),  each 
with  its  Own  kind  of  initial  and  boundary  conditions  in  order  to  be  well-posed,  while  there 
is  only  one  kind  of  ode.  If  the  computer  is  turned  loose  on  a  badly  posed  pde,  it  prints 
out  reams  of  nonsense. 

Different  ways  of  circumventing  these  difficulties  in  pde  dynamics  have  been  devised?  since 
the  briitie  force  method  of  simply  integrating  them  from  initial  arid  boundary  conditions 
as  specified  in  existence  theorems  is  usually  impracticable,  even  for  the  largest  computers. 
Solving  pde’s  in  physics  is  as  much  an  art  as  a  science;  it  takes  experience  arid  good  intuiton 
to  capture  the  essential  features  of  a  rigorous  solution  while  neglecting  a  vast  number  of 
inessential  details.  We  will  not  discuss  all  of  these  ways  here  but  focus  on  one  method  only, 
the  Galerkin  method,  which  is  particularly  well-suited  to  revealing  those  features  of  a  pde 
dynamics  in  which  nonlinear  dynamics  (“chaos  theory”)  is  interested,  it  in  fact  converts 
the  pde  dynamics  into  a  finite- dimensional  flow,  about  which  we  know  quite  a  lot  (Chap. 
2). 

7.2  Convection  Equations 


A  typical  phenomenon  of  interest  is  convection  in  an  incompressible,  viscous  fluid.  The 
basic  equations  are  the  Navier-Stokes  equations  of  hydrodynamics  and  an  energy,  or  heat, 
equation.  In  the  Boussinesq  approximation  and  in  nondimensionalized  form  the  equations 
read 

dv/dt  +  -(v  •  V)v  =  -  Vp  +  Av  +  RTn  ,  (7  -  la) 


+  v  •  VT  =  AT  +  A 


(7  - 16) 


V  •  v  =  0  , 


(7-1  c) 


Boundary  conditions.  For  example  with  solid  walls: 

dT 

v  =  0  ,  T  =  0  ,  a  condition  on  —  (on  the  walls).  (7  -  Id) 

Here  v  =  fluid  velocity,  T  =  absolute  temperature,  p  =  pressure,  A  =  V2  is  the  Laplacian. 
The  stationary  or  quiescent  solution  of  the  original  convection  equations  is  simply 

vq  =  0  ,  VTo  =  —An  ,  A  >  0  const. ,  (7  —  2) 


80 


which  describes  a  zero  velocity  field  and  a  uniform  vertical  temperature  gradient,  n  is 
a  unit  vector  in  the  direction  of  the  uniform  gravitational  field  g.  The  inhomogeneous 
boundary  conditions  of  those  original  equations  for  vlol  and  Ttoi  specify  velocity  or  its 
derivatives  and  temperature  or  its  derivatives  on  the  boundaries  in  physical  3-space  5£3. 
Having  found  the  quiescent  solution  (7-2),  one  writes  pde’s  for  the  excesses  v  and  T, 
defined  by 

v tot  =  Vo  +  V  (=  v) ,  Ttot  =  To  +  T. 

Since  T0  and  v0  satisfy  the  same  boundary  conditions  as  Ttot  and  v<ot,  the  excesses 
satisfy  homogeneous  b.c.  }s  as  in  (7-ld),  very  important  for  the  applicability  of  the  Galerkin 
method.  Let  “b.c.’s”  be  short  for  “boundary  conditions”  hereafter;  it  will  save  a  lot  of 
space.  Homogeneous  b.c.’s  mean  linear  b.c.’s:  if  fields  <j>x  and  <j>2  satisfy  the  b.c.’s,  then  so 
does  any  linear  combination  a<f>i  +  b<j>2.  The  excesses  and  the  coordinates  are  then  made 
dimensionless  by  dividing  them  by  available  dimensional  constants  (beware,  this  can  be 
done  in  several  ways!)  so  that  the  only  residue  of  the  dimensional  physical  constants  of  the 
original  convection  equations  left  in  the  equations  (7-1)  are  a  few  dimensionless  constants, 

R  =  gpAL'/vx  =  Rayleigh  number  ,  P  =  v/x  =  Prandtl  number.  (7  -  3) 

The  reader  can  refer  to  any  number  of  good  books  or  articles  for  this  Boussincsq  reduction 
and  for  the  definition  of  the  various  physical  constants  occurring.  We  follow  here.  Gcrshuni 
and  Zhukovitskii  (1976),  hereafter  GZ.  We  will  only  name  them  here:  [1  =  coefficient  of 
thermal  expansion,  \  =  thermal  diffusivity,  v  =  coefficient  of  kinematic  viscosity,  L  = 
characteristic  length,  n,  .4,  and  g  have  already  been  defined. 

7.3  The  Rayleigh  Problem 

If  we  assume  2D  motion  in  a  plane  horizontal  layer  with  z  pointing  up:  no  y- dependence 
and  vy  =  0,  equations  (7-1)  simplify.  For  a  stream  function  ip(x,z )  exists: 

vz  =  dip/dx  =  ,  vx  =  -dtp/dz  =  -?/>.  ,  (7-4) 

which  satisfies  V  ■  v  =  0  identically.  (From  now  on  we  often  use  a  subscript  x  or  z  to  mean 
d/dx  or  d/dz.)  Now  if  we  take  the  zx  curl  of  (7-la),  the  Vp  term  is  eliminated.  Note  that 

dvx/dz  -  dvz/dx  =  -ipzz  -  ipxx  =  -A2r/> , 

where  A2  is  the  2D  Laplacian.  The  nonlinear  terms  in  (7-la)  and  (7-lb)  become  Jacobian 
determinants,  defined  for  functions  /  and  g  of  x  and  z  as 

d(/>  <?)  _  Qf_  dg  _  df_  dg_ 
d(z,x)  ~  dz  dx  dx  dz 

We  then  get,  where  f  =  df/dt, 


A2^  —  A^  +  RTX  + 


1  d(if>,A  2?/>) 

P~W,x)  ’ 


(7  -  5a) 
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PT  =  AiT  +  ^  +  ^,  (7-51,) 

b.c.'s  :  ij)  =  ij)zz  =  T  —  0  at  z  —  0,1.  (all  re,  all  t ).  (7  —  5c) 

Here  the  homogeneous  b.c.’s  (7-5c)  came  from  the  Rayleigh  problem  b.c.’s:  the  upper  and 
lower  plane  boundaries  of  the  layer  are  free  surfaces  (no  tangential  fluid  stresses),  that  is, 

=  0  ,  dvx/dz  =  dvy/dz  =  0 

on  those  surfaces.  Also  L  was  chosen  to  be  the  height  of  the  layer,  so  that  dimensionless 
z  —  0,1  defines  those  surfaces. 

7.4  The  Galerkin  Method 

Equations  (7-5)  are  close  to  a  standard  form  for  nonlinear  pde’s,  which  we  take  as 

$  =  A$  +  F($)  ,  £($)  =  0.  (7  -  6) 

Here  $(a:,f),  x  6  3£n,  is  the  field  or  set  of  p  fields  in  question,  thus  $  €  3ftp.  A  is  a 
linear  operator  and  F,  a  nonlinear  operator  on  $  usually  involving  partial  derivatives  with 
respept  to  x.  £($)  =  0  is  the  set  of  homogeneous  boundary  conditions  and  possibly  linear 
pde’s  not  involving  d/dt,  like  (7-lc)  for  example.  We  shall  call  £($)  =  0  “the  b.c.’s”  for 
short.  Now  an  important  concept: 

Dof.  Let  77  be  the  linear  (or  vector)  space  of  all  p-component  spatial 
functions  <j)(x )  €  3?p,  a:  €  9ftn,  which  satisfy  C((j>)  =  0.  (7  -  7) 

(Remember  how  general  a  concept  “vector  space”  is:  any  set  of  objects  closed  under 
taking  linear  combinations!)  Any  such  <j>(x)  is  called  a  Galerkin  vector.  For  example,  in 
the  Rayleigh  system  (7-5)  any  Galerkin  vector  has  the  form 

<f>(x,  z)  =  ZJ^j  G  I  and  T  satisfy  (7  -  5c).  (7  -  8) 

We  assume  that  H  is  invariant  under  A  and  F,  A  :  H  -»  77,  F  :  H  — >  77.  The  hist 
requirement  is  highly  nontrivial. 

But  we  can  immediately  write  down  a  complete  set  or  basis  {<frnk}  of  Galerkin  modes 
satisfying  (7-8): 

tpnk  =  sinmrz  sin&.T  ,  T„ *  =  sinmrz  cos  kx  , 

n  =  1,2,3, •••  ,  0  <  k  <  oo  (7-9) 

The  basis  {<fink}  spans  the  infinite-dimensional  vector  space  77.  (One  advantage  of  the 
Rayleigh  b.c.  s  (/-5c)  is  that  the  Galerkin  modes  (7-9)  take  such  a  simple  form.  For 
one  free  surface  and  one  solid  wall  or  for  two  solid  walls,  or  for  more  generally  shaped 
boundaries  and  any  b.c.’s,  one  cannot  in  general  express  the  Galerkin  basis  in  terms  of 
finite  expressions  in  elementary  functions.) 
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For  the  general  system  (7-6)  let  {<?&{%)  |  all  a]  be  a  Galerldn  basis  for  seme  infinite  index 
set  of  labels  a.  Then  any  solution  of  (7-6)  has  the  form 

*GM)  =  X)  <*(*)£«(*)  (7  “ 10) 

a 

for  some  function  r~(i  $  time  only.  Eere  and  below  may  have  to  be  interpreted 
?r>  integral  f  da  if  the  labels  a  are  continuous.  T~L  invariant  ymder  F.  in  other  words, 
products  of  the  4>a  expressible  as  linear  combinations  of  all  the  oa.  was  necessary  for  the 
truth  of  (7-10).  Write  this  formally  as 

F(^2  c“^»)  =  5Z  for  ail>' c  —  ic<*\  a11  <*}-  (7  - 11) 

a  or 


The  Galerkin  modes  form  an  algebra  in  this  sense.  For  tbe  linear  operator  A  we  write 

A  X)  cp^p  =  £(w)*. »  (7  _  12) 

P  or,/? 

which  introduces  the  matrix  elements  aap  of  .4.  Then  when  we  put  (7-10),  (7-11),  and 
(7-12)  into  (7-6)  and  equate  coefficients  of  the  linearly  independent  <j>a  to  zero,  we  get  the 
equations  of  the  infinite-dimensional  nonlinear  flow 

c<*  =  X) a<*pcp  +  /«(c)  »  a11  a  >  (7  -  13) 

P 

that  is,  an  infinite  set.  of  coupled  nonlinear  first  order  ode’s  in  time  for  tile  „(f)!  Eq. 
(7-13)  is  exact,  and  equivalent  to  the  pde  system  (7-6)  in  principle. 

The  idea  of  the  Galerkin  mj.hod  is  to  approximate  the  solution  (7-10)  by  using  a  finite 
set  of  the  most  important  modes,  say  <j> i,  <f>2,  -  •  •  4>},i  after  renumbering  the  labels  a.  One 
writes  $(x.t)  «  2^i=i  ci(0&(0  and  gets  an  approximate  closed  set  of  M  ode’s  by  keeping 
only  M  of  the  equations  (7-13)  and  dropping  all  the  r;,  i  >  M,* in  those  equations: 


M 

C»  =  X:  aijcj  "b  fi{c  )  j 

c'  =  ir.1.c2.--r.tJ}  i  =  1,2,3,- --M.  (7-14) 

Choosing  the  “most  important  mod:,"  is  oi  course  an  art.  Sec  further  remarks  below. 
Technically,  one  truncates  the  infinite  set  (7-13)  by  projecting  it  onto  the  finite-dimemional 
subspace  H\f  C  “H  spanned  by  <j> j,  ^2,  •  •  •  &m-  In  particular,  /,(c')  comes  from  /,(c)  by 
setting  to  zero  all  c„  i  >  M.  This  prescription  is  all  right  if  the  fa(c)  admit  power  series 
expansions  in  the  cp. 
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One  method  of  choosing  a  Galerkin  basis  of  H  has  some  advantages.  We  use  the  normal 
inodes  gi(x )  of  the  linearized  problem  $  =  -4$,  with  the  same  b.c.*s  £($)  =  0.  Look  for 
solutions  <$(x,<)  =  extg(x).  Them,  cancelling  eXt,  we  get 

\g  =  Ag  t  £(g)  =  0 .  (7  — 15) 

Let  gfa),  i  —  1,2,3,-*-  be  the  set  of  eigenfunctions  aud  eigenvalues  of  A.  Usually 

the  set  {g-:}  spans  H;  if  generalized  eigenfunctions  of  A  exist  (see  appendix  A),  they  must 
be  adjoined  to  {<?,-}  in  order  to  span  7i.  Assuming  that  the  proper  eigenfunctions  do  span 
H.  we  can  use  them  instead  of  the  general  Galerkin  basis  used  above.  The  flows  (7-13) 
and  (7-14)  simplify  because  the  matrix  (<7y )  is  diagonal.  The  approximate  flow  then  reads 

Ci  =  A,-Cf  +  .  z  =  1,2,  -  *  *  M.  (7  — 16) 


7.5  Rayleigh  Problem  by  the  Galerkin  Method 


Let  us  come  back  to  the  specific  system  (7-5).  We  can  put  it  into  standard  form  (7-6)  by 
dividi.:g  (7-5b)  by  the  number  P  and  “dividing”  (7-5a)  by  the  operator  A2.  The  latter  is 
all  right  if  A2  has  an  inverse  Aj1  on  the  space  spanned  by  (7-9);  and  this  is  in  fact  so. 


since 


A2<j>nk  =  — (n2 7T2  +  k2)<j>nk  =  -InkAnk  , 
and  7 „k  is  alwaj's  positive..  So  we  rewrite 


rp  =  Arf  +  RA?TX  +  ~A^ 


P  2  d(z,x)  5 


T=iA2r+i^  +  i^S, 

P  P  P  d[z,  x) 

ip  —  i’zz  =  T  =  0atz  =  0, 1  (all  a;  and  all  t). 
The  normal  modes  are  (GZ  1976) 


i  _  p  1a^tsinn7rz  sinkx' 

Jnk  \T*k)  6*.  sin mtz  cos kx  ’ 


+ 7^0  b%  ,  7nt = „v + e. 


(7-17) 


(7  -  ISa) 


(7  - 186) 
(7 -18c) 


(7  -  19a) 


_  P  +  *-.  ,  IfP-lWi  .  Rk2]*  _fS 

nk  2 P  ^ y  2P  )  ^ nk  ^  Py  k  ’  ^ 


(7 -19c) 


(Please  note:  we  write  time-dependence  eXt  of  normal  modes,  GZ  write  e~Xt.  So  our 
eigenvalues  (7-19b)  arc  the  negatives  of  GZ’s,  and  we  have  changed  some  other  notation 
accordingly.)  The  complete  label  i  is  (±,n,k)  with  n  =  1,2,3,- ••  and  0  <  it  <  00.  The 
parameter  y  is  the  pair  R ,  P. 
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Local  bifurcation  values  are  determined  by  A*t  =?  0  for  each  n  and  k.  All  X~k  <  0. 
Then  the  mode  (+,n,  Jfc)  changes  from  decaying  (A  <  0)  to  growing  (A  >  0)  as  A  increases 
through  zero.  (Note  all  A’s  are  real.)  See  Fig.  7.1. 


Fig.  7.1  Showing  the  eigenvalues  \*k  for  k  =  2  and  n  =  1,2,3  as  functions  of  R.  Top: 
P  =  3,  middle:  P  —  1,  bottom:  P  = 

These  critical  Rayleigh  numbers  come  out  to  be 

Rnk  =  k~273nk  ,  (7-20) 

from  (7-19b),  independent  of  P.  So  Rnk  increases  monotonically  in  n  for  any  k.  The 
smallest  critical  Rayleigh  number  =  i?i  =  (27/4)7r4  (at  n  =  1  and  k  =  njy/2).  R\  is  thus 
the  threshold  for  convection;  for  R<R\tt  657.5  there  is  only  the  quiescent  solution  (7-2). 

The  wave  number  k  in  the  x-direction  can  be  fixed  by  various  experimental  techniques,  for 
example,  by  the  aspect  ratio  of  the  cell  approximating  the  infinite  plane  horizontal  layer 
of  the  theory  (see  Chap.  8).  We  consider  it  fixed  hereafter.  Then  if  Rwk  <  R.<  R„,+\  *, 
there  will  be  a  complicated  convection  with  m  growing  modes  and  the  rest,  decaying  modes. 
Thus  it  would  seem  reasonable  to  try  to  approximate  the  nonlinear  motion  in  this  regime 
by  a  Galerkin  approximation  including  these  m  modes  plus  possibly  a  few  others.  (The 
flow  is,  after  all,  nonlinear,  so  we  cannot  rule  out  the  influence  of  decaying  modes,  even  as 
t  —*  oo.) 

Let  us  fix  k  and  assume  R.lk  <  R  <  R2k.  Then  the  unstable  subspace  En  will  be  ID 
°nly,  spanned  by  <71  =  gfk,  since  Xfk  >  0.  All  other  modes  are  decaying,  E  E9.  If  we 
want  a  physically  good  Galerkin  approximation  with  M  >  1  modes,  how  do  we  choose  the 
further  modes?  There  is  currently  no  clear,  agreed-upon  algorithm  for  this  choice.  The 
pioneering  work  of  Lorenz  (1963)  and  succeeding  work  have  provided  some  hints.  For 
example,  g2  =  g%0  should  certainly  be  included.  We  can  understand  this  by  noting  that 
the  nonlinear  terms  in  (7-18)  generate  this  mode  from  gfk  in  the  sense  of  (7-11).  Namely, 
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if  F  acts  on  the  single  mode  gfk,  it  produces  the  mode  g£0.  This  idea  caii  Be  ffi&de  into 
a  precise  algorithm  for  generating  a  sequence  of  improving  Galerkin  approximations  (see 
Ingraham,  1990). 

So  now,  as  a  simple  model  Galerkin  approximation,  take 


$(x,t)  =  ci(t)gi(x)  +  c2(t)g2(x) ,  (7-  21) 

where  x  G  3 i2  stands  for  (x,z).  From  (7-19)  the  normal  modes  are 

(k~lai  sin7T2  sinkx\  (  0  \  ,  x 

gi  =  .  ,  )  ,  92  =  l  .  0  )  >  U  - 11) 

\  sm  tcz  cos  kx  )  \sm  2~z J 

whore  wc  took  bfk  =  bfo  =  i  and  set  a\  =  afk  determined  by  (7-i9c),  while  a2  =  a%0 
does  hot  occur,  see  g2.  Put  this  into  the  pde’s  (7-i8)  and  truncate  at  the  two  modes  kept, 
as  described  above.  The  main  work  comes  in  computing  the  coefficients  /i(c')  in  (7-16). 
Because  of  the  simple  form  of  the  normal  modes  for  the  Rayleigh  problem,  (7-ll)  is  true* 
arid  tlus  algebra  is  in  fact  finite:  F  acting  on  a  finite  sum  ca4>a  produces  only  a  finite 
sum  £)/<*&*•  This  is  traced  to  the  simple  trigonometric  identities 

sin  Iff  sin  mff  =  ^[cos(Z  —  m)ff  —  cos (l  +  m)ff\  , 

St 


cos  Iff  cos  mff  —  -[cos(/  —  m)ff  +  cos(/  -f  m)0]  , 
z 


(7-23) 


sin  Iff  cos  mff  =  -[sin(Z  —  m)ff  +  sin(Z  +  m)ff\. 

Si 

In  our  problem,  with  normal  modes  (7-22),  we  encounter  only  the  cross  products 


sin?r.s  cos  27 tz  =  -(sin37rz  —  sin7rz)  , 

Si 


sin  7rz  cos  tcz  =  —  sin  27iv  ,  (7  —  24) 

z 

as  can  be  verified.  Since  we  keep  only  the  two  modes  <71  and  g2,  the  sin37rz  will  not 
contribute  at  all.  Eventually  we  arrive  at  the  flow  (7-16)  for  M  =  2.  By  solving  it 
numerically  we  can  get  some  idea  of  the  actual  flow  near  R\k- 

Prob.  7.1  Carry  through  the  Galerkin  approximation  (7-21).  For  the  flow  (7-16) 
you  should  find 


X  _  \  _  1  7TGi  03  •  \  TT(l]  2 

Cl  “  A,Cl  +  ;>(<*- a, )ClC2  ’  C2  =  A2C2  ■  2 PC'  ’ 


(7-25) 


where  aj  =  afki  03  =  alk  determined  by  (7-19c)  with  bfk  =  blk  =  1  and  Aj  =  A fk  >  0, 
A+<0. 
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Prob.  7.2  Carry  through  the  three-normal  mode  approximation 


$(x,<)  =  a(t)gi(x)  +  c2(t)g2(x)  +  c3(t)g3(x) 


with  <73  =  glk  and  <71,  g2  as  before. 

This  latter  flow  in  9?3  is  equivalent  to  the  3-Galerkin  mode  approximation  of  Lorenz  (1963) 
for  k  =  7r/\/2,  as  can  be  verified. 

7.6  Cautionary  Remarks 

The  short  survey  of  the  general  Galerkin  method  in  section  7.4  is  deceptively  simple.  The 
real  work  is  concealed  in  the  equations  (7-6)  and  (7-12).  The  reduction  to  normal  form  (7- 
6)  depends  on  which  fields  one  wants  to  isolate;  the  others  me  eliminated  at  the  expense 
of  raising  the  differential  order.  For  example,  (7-5a)  is  fourth  order,  while  the  original 
equations  are  only  second  order.  When  3D  motion  is  considered,  the  representation 

V  =  vVoi  -f  -Vtor  =  Vx(Vxr^)  +  Vxrw,  (7-26) 

which  satisfies  the  incompressible  constraint  V  •  v  =  0  identically,  is  often  used.  This 
already  increases  the  differential  order  by  two,  and  further  manipulation  to  separate  the 
equations  for  the  poloidal  and  toroidal  stream  functions  rp  and  w  as  much  as  possible 
increases  it  still  further.  , 

As  to  (7-12),  it  takes  considerable  stamina  to  do  the  algebra  which  results  in  the  explicit 
coefficients  fa(c)  when  M  is  not  small.  Further,  this  algebra  need  not  be  finite:  finite 
number  of  terms  on  the  right  of  (7-11),  when  a  finite  linear  combination  of  modes  is  the 
input  on  the  left.  This  infinite  sum  may  also  involve  integrals.  This,  unfortunately,  is 
the  case  usually  encountered  -  the  Rayleigh  problem  has  an  unusual  simplicity  because 
of  the  simple,  finite  closure  properties  of  trigonometric  functions  under  multiplication.  In 
fact,  the  Galerkin  method  with  m  growing  modes  is  only  practicable  when  the  nonlinear 
operator  F  generates  a  few  further  modes  with  nonnegligible  coefficients  fQ(c).  It  is  hard  to 
say  a  priori  which  pde  systems  and  b.c.’s  guarantee  this  criterion.  But  the  reader  should 
not  forget  that  we  are  confronting  real  nonlinear  motion  laws  head  on  by  the  Galerkin 
method.  Neat,  simple  solutions  cannot  be  expected. 

Working  out  probs.  7.1  and  7.2  will  give  the  serious  reader  an  understanding  of  the  general 
Galerkin  method.  There  is  no  substitute  for  the  actual  labor  of  applying  the  nonlinear 
operator  to  a  sum  of  normal  modes,  producing  a  general  Galerkin  vector,  then  reexpressing 
this  as  a  sum  of  normal  modes. 

7.7  Patterns  of  Convection 

We  look  at  the  pattern  of  convection  for  our  simple  Rayleigh  problem.  By  (7-4) 

Vz  =  tpz  ,  vx  =  -ip- .  (7  -  27) 
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Since  we  have  2D  motion  with  no  y-dependence,  the  convection  must  be  ini  the  form  of 
“y-rolls,”  with  axes  parallel  to  the  y-axis.  For  a  single  mode  we  get 

v~  oc  sin  ivkz  cos  kx  ,  vx  <x  —k^mt  cos  nirz  sin  kx  (7  —  28) 

from  (7-19a).  The  velocity  nodes  (v  ==  0)  are  at  the  lattice  points. 

(kxjhTcz)  =  (p  ir/2 ,  q  ir/2)  ;  p,  q  integers,  both  even  or  both  odd.  (7  —  29) 

Further,  the  stability  of  these  velocity  fixed  points  is 

p,  q  both  odd  =>■  center  ;  p,  q  both  even  =£■  saddle  point.  (7  —  30) 

This  is  most  easily  seen  by  drawing  velocity  vectors  along  the  points  Of  the  horizontal  and 
vertical  lines  through  lattice  points  (7-29),  using  (7-28).  The  streamlines  of  these  rolls  fill 
out  the  squares,  they  are  not  circles  mound  the  centers.  The  boiiiidaries  of  the  squares 
are  streamlines,  the  separatrices  of  the  saddle  points.  See  Fig.  7.2. 
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8.  EXPERIMENTAL  REALIZATIONS  OF  NONLINEAR  DYNAMICS 
8.1  Introduction 

So  far  we  have  seen  several  theoretical  “paths  to  chaos.”  There  is  the  period-doubling  route 
2”-cycles  — >  “chaos”  as  n  —*  oo  in  ID  maps,  caused  by  increasing  the  parameter.  Then 
there  is  the  route  n-tori  — »  KAM  tori  plus  stochastic  layers  (“chaos”)  from  the  break-up  of 
resonance  zones  in  Hamiltonian  dynamics.  This  is  caused  by  increasing  the  strength  of  a 
nonintegrable  perturbation  to  an  integrable  Hamiltonian'.  Then  there  is  the  intermittency 
route:  periodic  motion  interrupted  by  bursts  of  “chaotic”  behavior,  which  approaches  a 
fully  “chaotic”  regime  as  some  parameter  is  varied.  All  of  these  routes  have  been  seen 
experimentally.  We  describe  a  few  of  these  experiments  briefly  in  this  chapter. 

We  shall  discuss  pde  systems  in  the  rest  of  this  section,  focusing  on  incompressible  viscous 
fluid  flow.  To  see  the  various  stable  regimes,  the  bifurcations  connecting  them,  the 
associated  universal  numbers,  etc.,  it  is  usually  sufficient  to  measure  any  of  the  physical 
fields,  say  vx,  vy,  vz,  or  T,  at  a  fixed  point  in  space  as  a  function  of  time. 

This  brings  up  the  question:  what  is  the  connection  of  the  actual  fluid  velocity  and 
temperature  distributions  in  3D  configuration  space  and  the  mathematical  image  of  it,  the 
M  functions  of  time  ci(f),  C2(f),  •  •  •  cjv/(f)  which  we  study  in  a  Galerkin  approximation? 
This  is  determined  of  course  by  the  expansion 

$(x,  t)  =  J2  ci(t)gi(t )  (8  -  1) 

i 

of  the  fields  in  normal  modes  gi.  From  that  we  can  understand  the  following  correspon¬ 
dences. 


a.  A  stable  fixed  point  c*  of  the  flow 


Ci  =  AjC;  +  /,(c)  , 


(8-2) 


c*  =  0,  all  i,  corresponds  to  a  stable  stationary  fluid  motion  ( and  temperature  distribution 
understood  hereafter),  typically  convection  rolls. 

b.  If  the  flow  (8-2)  shows  a  stable  periodic  orbit  (limit  cycle)  c,-(t  +  T)  =  c,-(f),  all 
t,  then  the  fluid  motion  becomes  periodic  with  period  T,  a  regime  called  oscillatory.  (A 
critical  Reynolds  number  Re  =  vL/v  governs  the  onset  of  oscillatory  motion.  Here  v  = 
characteristic  velocity,  and  L  and  v  were  defined  in  chap.  7.)  Period- doubling  in  the  flow 
at  a  bifurcation  value  of  the  Rayleigh  number  leads  to  a  bifurcation  in  the  fluid  motion  to 
an  oscillatory  motion  of  period  2T.  If,  finally,  the  flow  enters  a  “chaotic”  regime,  say  it  is 
chaotic  in  the  precise  sense  of  definition  (5-5),  the  fluid  motion  is  correspondingly  chaotic. 
One  could  in  principle  measure  the  sensitive  dependence  of  the  fluid  quantities,  say  T(r,t), 
by  varying  the  pde  initial  conditions  slightly. 

c.  On  fixing  the  wave  number  k  by  experimental  techniques,  the  following  is 
presumably  the  basic  idea.  Experiments  use  small  rectangular  parallolopipedal  cells  to 
approximate  the  infinite  horizontal  layer  of  theory  (section  7.3).  See  Fig.  8.1. 
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Fig.  8.1  Experimental  “rectangular”  cell  of  height  L  and  z-width  Lx 


First,  Lx  »  Ly  is  desirable.  Second,  if  an  integral  number  l  of  identical  convection  rolls 
form,  one  has  kLx  =  hr  (see  Fig.  7.2),  which  fixes  k. 

8.2  Some  Specific  Experiments 
8.2.1  Pde  Systems 


a.  Libchaber  and  Maurer  (1982).  Liquid  4He  in  a  small  cell,  whose  top  and  bottom 
were  highly  conducting  plates.  Two  bolometers  imbedded  in  the  upper  plate  give  local 
temperature  readings.  The  cell  dimension  “favored”  two  or  three  convection  rolls.  The 
Prandtl  number  was  small,  0  <  P  <  1.  Temperatures  were  in  the  range  2.5-4.5K.  The 
temperature  difference  ATq  =  AL  (cf.  (7-2))  was  only  the  order  of  mK  for  the  onset  of 
convection. 


As  the  Rayleigh  number  was  increased  above  the  regime  of  stable  stationary  convection, 
they  saw:  first,  oscillatory  motion  of  a  frequency  zq  (which  increased  linearly  with  R), 
then  the  appearance  of  a  second,  incommensurable  frequency  then  frequency  locking 
i/2  — »  zq/ 2,  then  a  cascade  of  subharmonics  (period-doubling)  zq/2,  zq/4,  zq/8,  zq/16,  •  •  • 
resolved  up  to  zq/16.  These  are  all  obtained  from  the  signal  T\  (t)  of  one  of  the  bolometers. 
Other  regimes  were  also  seen,  which  we  skip  here. 


Frequency  locking  is  a  phenomenon  in  nonlinear  oscillators  whereby  for  two  close  enough 
frequencies  zq,  zq  of  the  motion  zq  suddenly  jumps  to  a  subharmonic  of  zq.  The  reader  is 
referred  to  Bak  (1986). 


Let  J?n  be  the  period-doubling  bifurcation  values  of  the  Rayleigh  number, 
that 


5  =  lim 

n-K>0  Rn+1 


Rn  ~  Rn— i 
Rn 


3.5 


It  was  found 
(8-3) 
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to  the  observed  resolution.  Cf.  S  =  4.6692  •  •  •  for  the  corresponding  logistic  map  ratio. 

b.  Libciiaber,  Laroche,  and  Fauve  (1982).  Liquid  mercury  in  a  small  cell  with  top 
and  bottom  plates  of  copper.  A  single  bolometer  was  imbedded  in  the  bottom  plate. 
The  aspect  ratio  of  the  cell  “favored”  four  convection  rolls.  The  rolls  were  stabilized 
by  a  uniform  magnetic  field  parallel  to  their  axes.  (We  remark  that  this  goes  beyond  the 
system  (7-1)  into  MHD,  or  magnetohydrodynamics.  MHD  is  the  proper  dynamics  for  some 
extremely  important  and  interesting  areas,  for  example,  plasma  physics,  stellar  interiors, 
the  geodynamo  (source  of  the  Earth’s  magnetic  field).) 

As  the  Rayleigh  number  was  raised  beyond  the  regime  of  stable  convection  rolls,  oscillatory 
motion  set  in.  Then  a  subharmonic  cascade  iq,  iq/2,  iq/4,  rq/8,  V\/16  was  seen  in  the 
reading  T(t)  of  the  bolometer.  See  Fig.  8.2  for  the  time  series  T(i.)  and  its  power  spectrum. 
The  number  6  =  (8-3)  was  observed  to  be  «  4.4,  nearer  the  universal  value  of  the  logistic 
map. 


Fig.  8.2a  Time  series  T(t)  of  the  temperature  for  various  Rayleigh  numbers  R.  Rc  is  our 
Ri  =  absolute  minimum  of  the  Rnk  (see  chap.  7)  (From  Libchaber,  Laroche,  and  Fauve 
(1982),  with  permission) 


For  the  theory  of  the  complicated  set  of  stable  regimes  and  their  bifurcations,  see  Busse 
(1978).  For  the  experiments  summarized  in  this  chapter,  see  Cvitanovic  (1984,  part  2). 
From  now  on  our  descriptions  will  be  briefer. 

c.  Gollub  and  Swinney  (1975).  They  measured  a  fluid  velocity  component  at  a  fixed 
point  in  a  rotating  fluid.  The  system  is  an  incompressible  viscous  fluid,  but  is  not  exactly 
that  of  the  convective  system  (7-1). 
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Fig.  8.2b  Power  spectrum  of  T(t).  Arrows  indicate  the  peak  at  v\.  (From  Libchaber, 
Laroche,  and  Fauve  (1982),  with  permission) 

d.  Berge  et  al.  (1988).  They  measured  vz  locally  and  an  averaged  V*T  in  silicone 
oil,  where  the  Prandtl  number  P  is  large,  P  130.  They  found  evidence  of  intermittency. 
(section  3.10.3) 

8.2.2  Other  Systems 

e.  Hudson  and  Mankin  (1981).  A  chemical  system,  the  Belousov-Zhabotinskii 
reaction.  They  measured  the  concentration  of  one  chemical  species  by  an  indirect  method. 
The  system  here  is  an  actual  finite-dimensional  flow,  cf.  the  simple  Brusselator  model  of 
section  2.7.1. 

f.  Arecchi  et  al  (1982).  A  quantum  optical  system,  a  periodically  perturbed  CO2 
laser.  Mathematically,  the  system  is  a  nonautonomous  flow  in  3£2  (thus  an  autonomous 
system  in  one  higher  dimension). 

g.  Testa  et  al.  (1982).  An  electronic  system,  a  driven  nonlinear  oscillator.  This 
is  a  nonautonomous  flow  in  W‘.  Electronic  systems  with  their  precise  controls  and 
measurements,  allow  the  cleanest  tests  of  the  theory. 

h.  Guevara,  Glass,  and  Shrier  (1981).  Living  matter  system  (!),  cardiac  cells  perturbed 
by  electrical  current  pulses. 
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APPENDIX  A 

GENERAL  LINEAR  STABILITY  ANALYSIS 

We  consider  the  linear  flow  x  =  Ax,  x  €  &N,  A  =  real  N  x  N  matrix.  In  the  general 
case  the  N  proper  eigenvectors  et-  of  A,  see  (2-4),  are  not  linearly  independent  (=  do  not 
span  kN).  A  simple  example  in  3J2  is  (*  j).  The  double  eigenvalue  A  =  1  has  only  one 

linearly  independent  eigenvector  (J) .  However,  in  any  case  we  can  introduce  the  generalized 
eigenvectors,  which  do  span  .  The  vector  /  is  called  a  generalized  eigenvector  of  A  to 
eigenvalue  A  if 

( A  —  Al  )pf  =  0  for  some  integer  p,  1  <  p  <  N.  (A  —  1) 

For  p  =  1,  /  reduces  to  an  ordinary,  or  proper,  eigenvector. 

Let  /,-,  i  =  1,2,*  "N,  be  the  linearly  independent  generalized  eigenvectors.  For  matrix 
A  =  (aij),  say  they  have  components  fij ,  that  is,  /y  is  the  jth  component  of  vector  /;. 
Then  A  can  be  put  into  Jordan  canonical  form  by  the  similarity  transformation  T-1AT, 
by  using  the  matrix  T  whose  ij th  matrix  element  is  Ty  =  /y.  That  is,  the  vector  /,•  forms 
the  ith  column  of  the  matrix  T.  If  the  are  properly  ordered,  A  takes  Jordan  canonical 
form ,  namely  A  is  in  block  form 

\B 1  01 

b2 

A=  ..  (A-2) 

■  0  Bq. 

with  submatrices  Bk  along  the  diagonal,  where  each  Rjt  has  the  form 

[A  1  01 

A  1 

.  (A -3) 

A  1 

LO  aJ 

That  is,  a  single  eigenvalue  A  on  the  main  diagonal,  l’s  along  the  main  superdiagonal,  and 
zeros  elsewhere.  The  reader  can  easily  check  that  this  includes  the  case  of  A  diagonalizable. 
Then  the  Bk  are  all  lxl,  simply  the  eigenvalues  Afc  of  A.  It  is  also  clear  that  if  all  the  N 
eigenvalues  A,-  are  different,  A  must  be  diagonalizable. 

To  test  your  understanding  of  this,  try  to  do  the  following  exercises. 

Ex.  A.l 

a.  Given  a  generalized  eigenvector  /,  let  p  be  the  minimal  exponent  for  which 
(A-l)  holds.  Prove  that  the  vectors  /,  (.4  -  Al)/,  (A  -  Al)2/,  (A  -  Al)1’"1/  are  linearly, 
independent. 

b.  If  p  =  N  in  part  (a),  we  have  N  linearly  independent  vectors  e,-  =  (A  —  Al )*/, 
i  =  0, 1,2,  •  • '  N  —  1.  Show  that  in  this  case,  if  the  e,-  are  taken  as  basis  elements,  A  has 
the  form  (A-4)  of  a  single  Jordan  block.  (Hint:  remember  the  relation 

N 

Ae,1  =  ^  ]  Qjif'j  >  i  =  1, 2,  •  ’  •  N 
j=  i 

defining  the  matrix  elements  of  aji  of  a  linear  operator  A.) 
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APPENDIX  B 

PROOF  OF  NO  SD  IN  THE  QUANTUM-MECHANICAL  SELF-CORRELATION 

This  map  of  the  state  vector  is  defined 

C(i,r)  =  (ip(t)\ip(t  -f  r))  ,  t  =  time  ,  r  a  const.  ( B  —  1) 

This  is  a  complex  number;  the  corresponding  metric  d  is  the  absolute  value  of  the  difference. 
Thus,  given  \\ip'  —  ip\\<6  small,  we  want  to  prove 

AC=  {ip'(t)\ip'_(t  +  r))  -  {ip(t)\ip(t  +  r))  {B-2) 

always  small,  where  ip{ 0)  =  ip,  rp'(0)  =  ip',  and  both  are  normalized. 

Now  ip(t)  =  U (t,  0)  ip(0),  ip(t  +  r)  =  U(t  +  t,  0)  ip(0),  and  similarly  for  ip'(t)  and  ip'(t  +  r), 
where  U  is  the  unitary  time-evolution  operator.  Here  we  place  no  restriction  on  the  QM 
system  :  “H  has  any  dimension,  including  oo;  the  energy  operator  is  any  whatsoever,  and 
may  depend  explicitly  on  t.  We  can  thus  rewrite 

(ip'{t)\ip'(t  +  r))  =  (ip'\lj(t,  r)\ip')  ,  (B-  3) 

where  U(t,  r)  =  U-1(<)  U  (t  +  r),  and  the  same  for  ip. 

Now  we  need  a  convenient  representation  for  any  ip1  close  to  ip,  such  that  II ip1  -  ip\\  =  Si 
say,  0  <  <  S.  This  is 

ip1  =  N~\ip  +  Six),  where  ||x||  =  1,  (x\i>)  =  0  , 

N  =  (l  +  6l)±,  (B- 4) 

as  the  reader  can  verify.  We  repeat:  any  state  ip'  in  the  ^-neighborhood  ||V>'  -  ip\\  <  6  can 
be  represented  as  in  (B-4)  for  some  unit  state  x  normal  to  ip  and  some  <  6. 

Put  (B-4)  into  (B-3).  We  get 

{ip'\(j\iP')  =  N~2  (iP\U\iP)  +  ^(x|U|x)  +  26 !  Re(X\U\iP)  .  (B  -  5) 

This  is  then  substituted  into  AC,  equation  (B-2).  Now  we  use  some  elementary  inequalities 
on  complex  numbers  and  the  Schwarz  inequality  in  QM: 

<#>|  <  IMHI&II ,  (B-6) 

where  (a|6)  is  the  scalar  product  of  two  vectors  in  H  and  ||a||2  =  (a|a),  etc.  We  get 

AC  =  (N~ 2  -  1){iP\U\iP)  +iV-2^2(x|U|x)  +2iV2^i?e(x|U|^) 
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-  I 

#:os#)J + A7_2^pPdx) 

J-X--&  -T  2JSr~2Si  =  ~J—  <  2£(1  -r  S)  ,all  £  (and  all  r).  (B  -  7) 

"  1  1-S -£f 

Here  the  Schwarz  inequality  gave  |(\[Ufv}|  ^  Sxi '  1M£1  =  1*1  =  7  since  §\|  =  |v|  =  1 
and  the  uiiiiarv  U  docs  not  change  the  norm-  Similarly  for  the  o»hcr  tenns. 

Now  the  formal  proof  that  C(£,r)  is  Lyapunov  stable  at  any  initial  state  t%  thus  has  no 
SD  (5-2).  is  obvious,  but  let  us  carry  it  through  to  the  end  lor  clarity.  Refer  to  Def.  (5-6). 
Givca  an  e  >  0.  Then  for  -  ${[  <  £(e)s  where  we  take  6(e)  to  satlsfy 

2£(1  -f  £)  <  £ ,  (£-8) 

we  have  zaC  <  -6(1  -j-S)  <  e.  ali  t.  Q.E.D. 


+2X'26t 
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GLOSSARY  OF  PRINCIPAL  TERMS 


The  terms  marked  with  *  are  defined  for  both  Sows  (continuous  time  systems)  and  map 
dynamics  (discrete  time  systems).  In  this  glossary,  only  the  definitions  for  flows  are  given. 
The  emendations  needed  to  make  the  definitions  apply  to  discrete  time  systems  can  be 
found  by  consulting  Chapter  3,  especially  sections  3.2  -  3.S. 

algorithmic  complexify  (AC)  -  The  algorithmic  complexity  K(sy)  of  the  finite  word 
sy  is  the  length  (number  of  symbols)  of  the  shortest  algorithm  which  will  print  the  word. 

analytic  function  -  A  real-valued  function  /  of  real  variables  x  £  3£iV  is  analytic  in  a 
neighborhood  V  C  9?'V  if  /(r)  admits  a  power  series  expansion  in  x  about  every  point 
xo  £  V. 

asymptotically  stable*  -  The  fixed  point  x*  is  asymptotically  stable  if  it  is  stable  and 
for  even'  neighborhood  V  C  U  of  x*  a  neighborhood  V\  C  V  of  x*  exists  such  that 
O/(x0)  ~i  x\i  —r  -foo,  for  every  xo  G  Vj. 

attracting  set  -  A  closed  and  invariant  set  A  is  called  an  attracting  set  if  1  a  neighborhood 
V  of  A  such  that  6t{x)  £  V  for  t  >  0  and  o*(x)  — *  A.  i  — 1  -foe,  for  all  *  C  /.  This  definition 
can  be  strengthened  by  requiring  some  extra  properties  such  as  indccomposability  (A 
contains  a  dense  orbit),  generalized  dimension  in  some  range,  “chaotic"  flow.  c.c.,  and 
then  such  sets  are  called  attractors  or  strange  attractors. 

attractor  -  see  attracting  set. 

autonomous  flow  -  If  the  defining  equations  of  a  flow  x  =  /(x,t)  =  /(x)  do  not  depend 
explicitly  on  t,  the  flow  is  said  to  be  autonomous. 

bifurcation*  -  The  sudden  qualitative  change  in  a  flow  caused  by  an  arbitrarily  small 
change  in  its  velocity  field  /(x),  that  is,  such  that  the  flows  before  and  after  the  bifurcation 
cannot  be  c  mtinuously  deformed  into  each  other.  More  precisely,  the  parameter  value  in, 
of  the  flow  x  =  f,i(x)  is  called  a  bifurcation  value  if  the  flow  is  not  structurally  stable 
at  ii  =  nb.  This  means  that  there  exist  arbitrarily  small  perturbations  5i/(x),  <52/(x)  of 
f„h(x)  such  that  the  perturbed  flow  x  —  ffib(X)  +  f(x)  is  not  to;  logically  equivalent  to 
the  perturbed  flow  x  =  //16(x)  -f  <52/(x). 

cascade  -  See  map  dynamics. 

center*  -  A  fixed  point  x*  is  a  <  r  if  it  is  stable  but  not  asymj  ■  tically  stable. 

center  subspace  ( Ec )*  -  A  center  subspace  is  the  subspace  of  .'.ic  phase  space  for  a 
linear  autonomous  flow  x  =  Ax  (where  A  is  a  real  N  x  N  matrix )  which  is  spanned  by 
the  subset  {uq,  iw2,  •  -  •  wyc}o{  eigenvectors  of  A  such  that  Re\i  =  f>. 

dissipative  system  -  An  autonomous  flow  x  —  f(x)  in  is  called  ..issipative  if  V-  /  = 
i  <  0  there.  Since  /(x)  is  the  velocity  field  of  the  flow  in  phase  space,  this  negative 
divergence  implies  that  comoving  volume  elements  are  shrinking  in  time.  Hence  a  nonzero 
volume  of  asymptotically  shrinks  to  volume  0  under  the  flow. 

finite  word  -  A  finite  word  sy  is  a  string  of  N  letters  from  some  alphabet.  C ,  for  example 
£  -  {1, 2,  -  •  -m}.  A  special  case  is  £  =  {0, 1},  the  binary  alphabet. 
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fixed  point  (x*)*  -  A  fixed  point  x*  of  a  Sow  i  =  /(x)  is  defined  by  x*  =  /(x*)  =  0. 
flow  -  A  system  of  A7  first  order  ordinary  differential  equations  in  time  /, 


*  .  dx 

z  =  f(x,t),  x  =  —. 


€» 


■iV 


defines  a  Sow. 

Hamilton-Jacobi  (H-J)  equation  -  The  Hamilton-Jacobi  equation  is  the  single  partial 
differential  equation 

H(q.dWJdq)  =  E  =  const., 
for  Hamilton’s  characteristic  function  W. 

Hamiltonian  system  -  'The  flow 

q  =  dHjdp,  p  =  —dHjdq,  q.p  6  St", 

in  3?;V,  N  =  2 n,  is  a  Hamiltonian  system  of  n  degrees  of  freedom.  H  =  H(q,p,t)  is 
called  the  Hamiltonian.  If  H  does  not  explicitly  depend  on  t.  the  flow  is  autonomous,  and 
H(q(t),p(t))  is  a  constant  (or  integral)  of  the  motion.  In  other  words,  H  is  conserved. 

homeomorphism  -  A  homeomorphism  h  :  U  -*  V  maps  open  set  U  into  open  set  V  such 
that  li  is  continuous  and  h~l  exists  and  is  continuous. 

hyperbolic*  -  A  fixed  point  x*  is  called  hyperbolic  (sometimes  nondegenerate)  if  Df{x*) 
has  no  eigenvalue  A  with  Re  A  =  0,  where  Df(x*)  is  the  Jacobian  matrix  evaluated  at  the 
fixed  point.  That  is,  if  Ec  =  0,  the  zero  vector. 

integrable  Hamiltonian  system  -  There  are  three  ways  to  define  an  integrable  Hamil¬ 
tonian  system: 

1)  A  Hamiltonian  system  is  integrable  if  the  solution  curves  o) 

=  (9('/o,Poj*)>K?o,Po,0)  are  analytic  in  z0  =  (?o>Po)  and  in  t. 

2)  A  Hamiltonian  system  is  integrable  if  there  exist  n  independent  analytic  constants 
of  the  motion  Fi(q(t),p[t ))  =  const.,  i  =  1,2, ■••n. 

3)  A  Hamiltonian  system  is  integrable  if  its  Hamilton-Jacobi  equation  admits  a  com¬ 
plete  solution. 

invariant  set  -  Set  S  is  invariant  if  <f>t(S )  C  S ,  — oo  <  t  <  oo. 

local  bifurcation*  -  A  local  bifurcation  at  the  fixed  point  x*  of  the  flow  occurs 

when  an  eigenvalue  A  of  the  Jacobian  matrix  Df,t(x*)  crosses  the  imaginary  axis  in  the 
complex  eigenvalue  plane.  If  ReX(pb)  =  0  and  this  zero  is  isolated,  m  is  called  a  local 
bifurcation  value. 

manifold  -  A  manifold  is  a  “continuous”  set  like  a  curve,  surface,  volume,  etc.,  or 
technically:  locally  just  like  for  some  positive  integer  M. 
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map  dynamics  -  Let  F  be  a  map  from  3iv  into  S  v  which  is  continuous  with  continuous 
first  derivatives,  written  F :  !RiV  — »  3J‘V,  F  £  C1.  F  determines  a  dynamics  in  which  time 
is  discrete  by  iteration,  namely 

i-n-fi  =  Hxn)  *  n  =  integer  . 

A  map  dynamics  is  also  called  a  cascade- 

phase  space*  -  The  space  of  all  states  of  a  system.  For  a  flow  in  9?:V,  phase  space  is  3?iV 
or  that  subset  17  C  in  which  the  flow  is  confined. 


saddle  point*  -  A  hyperbolic  fixed  point  x*  is  called  a  saddle  point  if  the  stable  subspace 
Ea  and  the  unstable  subspace  E*  of  the  phase  space  are  both  7=  0,  the  zero  vector. 

sensitive  dependence  (SD)  on  initial  conditions  -  The  dynamics  D  with  orbit 
function  Qt  and  metric  d[x',x)  has  sensitive  dependence  on  initial  conditions  at  state 
x  if  there  exists  an  e  >  0  such  that,  for  any  neighborhood  jY  of  x  there  exists  an  x'  £  jV 
and  a  t  >  0  such  that  d(6t(x').6t(x))  >  e. 

sink*  -  A  fixed  point  x*  is  called  a  sink  if  it  is  asymptotically  stable. 

source*  -  A  hyperbolic  fixed  point  x*  is  called  a  source  if  the  stable  subspace  Es  of  the 
phase  space  =  0,  the  zero  vector. 

stable  fixed  point*  -  A  fixed  point  x*  is  stable  in  U  if  for  every  neighborhood  V  C  U 
of  x*  there  is  a  neighborhood  V\  C  V  of  x*  such  that  every  solution  x(t)  =  6t(x0)  with 
xo  €  Vi  is  defined  and  £  V  for  all  t  >  0. 


stable  manifold*  (IF3)  -  The  stable  manifold  Ws  of  a  hyperbolic  fixed  point  x*  is  the 
set  of  all  those  points  which  converge  to  x*  under  the  flow.  That  is,  all  x  £  phase  space 
such  that  $t(x)  — *  x*,  t  — *  +00. 

stable  subspace*E3  -  A  stable  subspace  Es  is  the  subspace  of  the  phase  space  for  a 
linear  autonomous  flow  x  =  .4x  (where  A  is  a  real  N  x  N  matrix),  which  is  spanned  by 
the  subset  {uj ,  u2>  •  •  ■  UjV, }  of  eigenvectors  of  A  such  that  Rc\{  <  0. 

strange  attractor  -  see  attracting  set. 

structurally  stable  -  see  bifurcation  value. 


topologically  equivalent*  The  /-flow  and  the  (7-flow  in  sets  X  and  Y  respectively  are 
topologically  equivalent  if  and  only  if  there  exists  a  kemeomorphism  li:  X  — »  Y  such  that 
for  every  fj,  li  o  <f>{x  =  o  h  for  some  t^. 


topologically  transitive  -  For  a  dynamics  D  with  orbit  function  <£/,  V  is  topologically 
transitive  if,  for  any  pair  of  open  sets  U\ ,  U2  C  V,  the  image  of  U\  under  the  dynamics 
intersects  t/2  at  some  time:  <f>t(Ui )  fl  Ui  ^  0for  some  time  t  >  0. 


unstable  fixed  point*  -  The  fixed  point  x*  is  unstable  in  U  if  it  is  not  stable,  that  is, 
if  3  a  neighborhood  V  C  U  of  x*  such  that  for  all  neighborhoods  V)  C  V  of  x*  3  an  orbit 
based  at  xq  £  Vi  which  ^  V  for  some  t  >  0. 

unstable  manifold*  (IF")  -  The  unstable  manifold  Wu  of  x*  is  the  set  of  all  those  points 
which  “diverge  from  x*  under  the  flow”  or,  more  precisely,  all  those  points  which  converge 
to  x*  as  time  runs  backwards:  <j>t(x )  — » x*,  t  — >  —00. 


unstable  subspace*Eu  -  An  unstable  subspacc  Exl  is  the  subspace  of  the  phase  space 
for  a  linear  autonomous  flow  i  =  Ax  (where  A  is  a  real  N  x  N  matrix),  which  is  spanned 
by  the  subset  { 1/ 1 ,  ?*25  *  •  •  ujvu  }  of  eigenvectors  of  A  such  that  7?r  A,  >  0. 
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